Antoine R. Dumontmarked the checklist item migrate scheduling db data (drop the date field in the extra_loader_arguments for deb listed_origins) [4] as completed
marked the checklist item migrate scheduling db data (drop the date field in the extra_loader_arguments for deb listed_origins) [4] as completed
Antoine R. Dumontmarked the checklist item deploy lister as completed
marked the checklist item deploy lister as completed
Antoine R. Dumontmarked the checklist item restart lister service as completed
marked the checklist item restart lister service as completed
Antoine R. Dumontmarked the checklist item deploy loader core as completed
marked the checklist item deploy loader core as completed
Antoine R. Dumontmarked the checklist item restart package loader services as completed
marked the checklist item restart package loader services as completed
What a mess! The existing data both in staging and production are not in the expected
shape for the loader. Hence the issue of failing the load [1]
I'm working on cleaning up that mess and trigger things correctly for the existing
debian origins we got.
After cleaning up the listed origins for visit_type 'deb' (table 'listed_origins') and
cleaning up the state of the right lister (table 'listers', name 'debian', instance_name
'Debian'). The listing is fine [2].
And the loading finally starts to give positive results. [3]
After some fighting to untangle the mess we had in the scheduling dbs:
wrong task type used
wrong data format in old entries
And some more work from @anlambert to actually have the last_update filled in [3] by the lister
(so the scheduler can do its work alone...)
I've finally managed to have deb origins scheduled in production the now "standard" way
(service swh-scheduler-schedule-recurrent).
And workers are happily ingesting those!
[1]
Dec 03 16:03:19 saatchi swh[3728333]: INFO:swh.scheduler.celery_backend.recurrent_visits:Skewed fetch for visit type deb with policy already_visited_order_by_lag: fetched 0.7530120481927711, requested 0.5Dec 03 16:03:19 saatchi swh[3728333]: INFO:swh.scheduler.celery_backend.recurrent_visits:Skewed fetch for visit type deb with policy never_visited_oldest_update_first: fetched 0.2469879518072289, requested 0.5Dec 03 16:03:23 saatchi swh[3728333]: INFO:swh.scheduler.celery_backend.recurrent_visits:deb: 6640 visits scheduled in queue swh.loader.package.debian.tasks.LoadDebianDec 03 16:03:29 saatchi swh[3728333]: INFO:swh.scheduler.celery_backend.recurrent_visits:Skewed fetch for visit type deb with policy already_visited_order_by_lag: fetched 1.0, requested 0.5Dec 03 16:03:29 saatchi swh[3728333]: INFO:swh.scheduler.celery_backend.recurrent_visits:Skewed fetch for visit type deb with policy never_visited_oldest_update_first: fetched 0.0, requested 0.5Dec 03 16:03:32 saatchi swh[3728333]: INFO:swh.scheduler.celery_backend.recurrent_visits:deb: 5000 visits scheduled in queue swh.loader.package.debian.tasks.LoadDebianDec 03 16:03:38 saatchi swh[3728333]: INFO:swh.scheduler.celery_backend.recurrent_visits:Skewed fetch for visit type deb with policy already_visited_order_by_lag: fetched 1.0, requested 0.5Dec 03 16:03:38 saatchi swh[3728333]: INFO:swh.scheduler.celery_backend.recurrent_visits:Skewed fetch for visit type deb with policy never_visited_oldest_update_first: fetched 0.0, requested 0.5Dec 03 16:03:39 saatchi swh[3728333]: INFO:swh.scheduler.celery_backend.recurrent_visits:deb: 813 visits scheduled in queue swh.loader.package.debian.tasks.LoadDebian
17:10:57 softwareheritage-scheduler@belvedere:5432=> select count(*) from listed_origins where visit_type='deb' and last_update is not null;+-------+| count |+-------+| 37604 |+-------+(1 row)Time: 1112.519 ms (00:01.113)
Antoine R. Dumontmarked the checklist item ensure deb loading origins are actually scheduled now [2] as completed
marked the checklist item ensure deb loading origins are actually scheduled now [2] as completed
Antoine R. Dumontmarked the checklist item ensure the nixguix loader actually finish loading now [3] as completed
marked the checklist item ensure the nixguix loader actually finish loading now [3] as completed
Antoine R. Dumontmarked the checklist item ensure the nixguix loader actually finish loading now [3] as incomplete
marked the checklist item ensure the nixguix loader actually finish loading now [3] as incomplete