Write specs about metadata workflow
(opening tasks for myself on my last day might not be the best idea, but this should be written somewhere)
The metadata workflow and strategy is about recovering descriptive metadata on the artifacts in the archive. This metadata can be found:
- in the content itself ->
intrinsic metadata
(implemented with #1232 (closed)) - not in the content ->
extrinsic metadata
-
extrinsic metadata
can be found with the content when listing or loading the content - or in a software registry (e.g Wikidata, swMath, ASCL..)
-
The different components and the storage infrastructure that was put in place to keep this information should be specified and documented.
A discussion started over the metadata_provider in swh/devel/swh-storage!112 (closed).
Migrated from T1344 (view on Phabricator)