Write Luigi tasks to generate the citation dataset
It should run similar analyzes to https://annex.softwareheritage.org/public/dataset/license-blobs/2022-04-25/ but on citation.cff
and codemeta.json
files collected here: https://annex.softwareheritage.org/public/dataset/citation-blobs/2022-04-25/
Migrated from T4714 (view on Phabricator)