Provide stats on extracted metadata in the indexer storage api
Number of origins that were indexed, how many have a non-empty set of metadata, breakdown per metadata type.
Migrated from T1484 (view on Phabricator)
Number of origins that were indexed, how many have a non-empty set of metadata, breakdown per metadata type.
Migrated from T1484 (view on Phabricator)
mentioned in issue #1483 (closed)
marked this issue as related to #1483 (closed)
added Indexer Metadata workflow Metrics/monitoring priority:Normal labels
Useful queries:
select count(*) from origin_intrinsic_metadata;
select count(*) from origin_intrinsic_metadata where metadata != '{"@context": "https://doi.org/10.5063/schema/codemeta-2.0"}';
(The latter is a hack, for a long-term solution, doing JSON operations to check if there is any key other than @context
would be better.)
changed title from Show stats on extracted metadata to Provide stats on extracted metadata
changed title from Provide stats on extracted metadata to Provide stats on extracted metadata in the indexer storage api
marked this issue as related to swh/meta#1485
closed