datacite: Fill in the gap with github
Push the following list (from datacite dataset) in the oneshot queue [1].
- schedule
- ingestion [2]
[2] https://grafana.softwareheritage.org/goto/tLg1I2LVz?orgId=1
Refs. #4400 (closed)
Push the following list (from datacite dataset) in the oneshot queue [1].
[2] https://grafana.softwareheritage.org/goto/tLg1I2LVz?orgId=1
Refs. #4400 (closed)
added activity::Processing label
changed milestone to %Extend archive coverage [Roadmap - Collect]
mentioned in commit swh/devel/snippets@f1b9414c
assigned to @ardumont
changed title from Fill in the gap with datacite to datacite: Fill in the gap with github
changed the description
Schedule:
$ export SWH_CONFIG_FILENAME=~/.config/swh/scheduler.yml; cat ~/downloads/priority.list.github | shuf | ./schedule_with_queue_length_check.py --queue-name oneshot-large-git
...
{'args': [], 'kwargs': {'url': 'https://github.com/kdeldycke/mail-deduplicate', 'lister_name': 'github', 'lister_instance_name': 'github', 'pack_size_bytes': 34359738368}}
{'args': [], 'kwargs': {'url': 'https://github.com/Gaulton-Lab/BetaCells_cytokines_T1D', 'lister_name': 'github', 'lister_instance_name': 'github', 'pack_size_bytes': 34359738368}}
{'args': [], 'kwargs': {'url': 'https://github.com/RViMLab/oflibnumpy', 'lister_name': 'github', 'lister_instance_name': 'github', 'pack_size_bytes': 34359738368}}
changed the description
mentioned in issue #4836 (closed)
changed the description
changed the description
The queue is now below the 3k which is roughly where it should be done. And now ongoing ingestion is no longer about github (the next task which fills-in-the-hole kicked in). So it's done.
closed