2023-11-10 17:07:26 swh-scheduler@db1:5432 λ select now(), name, instance_name, visit_type, enabled, count(*) from listed_origins lo inner join listers l on l.id=lo.lister_id where lister_id in (select id from listers where name='arch') group by name, instance_name, visit_type, enabled order by count asc;+-------------------------------+------+---------------+------------+---------+-------+| now | name | instance_name | visit_type | enabled | count |+-------------------------------+------+---------------+------------+---------+-------+| 2023-11-10 16:07:30.073042+00 | arch | arch | arch | t | 263 |+-------------------------------+------+---------------+------------+---------+-------+(1 row)Time: 29.870 ms
And are getting ingested.
But it seems there are issues as most (if not all origins get a checksum mismatch [2].
It seems the image is missing some compression tools which fails the ingestion. [3] [4] [5]
I've deactivate the loader for now. And i'm upgrading the image.
[1]
scheduler-schedule-recurrent INFO:swh.scheduler.celery_backend.recurrent_visits:arch: 263 visits scheduled in queue swh.loader.package.arch.tasks.LoadArch
[2]
swh-cassandra/loader-arch-85c4967dfd-rt2m4[loaders]: {"asctime": "2023-11-10 16:14:01,464", "threadName": "MainThread", "pathname": "/opt/swh/.local/lib/python3.10/site-packages/swh/loader/package/loader.py", "lineno": 706, "funcName": "load", "task_name": null, "task_id": null, "name": "swh.loader.package.loader", "levelname": "ERROR", "message": "Failed to load branch releases/3.9.1-2/python-3.9.1-2-x86_64.pkg.tar.zst for https://archlinux.org/packages/core/x86_64/python", "exc_info": "Traceback (most recent call last):\n File \"/opt/swh/.local/lib/python3.10/site-packages/swh/loader/package/loader.py\", line 691, in load\n res = self._load_release(p_info, origin)\n File \"/opt/swh/.local/lib/python3.10/site-packages/swh/loader/package/loader.py\", line 876, in _load_release\n dl_artifacts = self.download_package(p_info, tmpdir)\n File \"/opt/swh/.local/lib/python3.10/site-packages/swh/loader/package/loader.py\", line 420, in download_package\n download(\n File \"/opt/swh/.local/lib/python3.10/site-packages/tenacity/__init__.py\", line 289, in wrapped_f\n return self(f, *args, **kw)\n File \"/opt/swh/.local/lib/python3.10/site-packages/tenacity/__init__.py\", line 379, in __call__\n do = self.iter(retry_state=retry_state)\n File \"/opt/swh/.local/lib/python3.10/site-packages/tenacity/__init__.py\", line 314, in iter\n return fut.result()\n File \"/usr/local/lib/python3.10/concurrent/futures/_base.py\", line 451, in result\n return self.__get_result()\n File \"/usr/local/lib/python3.10/concurrent/futures/_base.py\", line 403, in __get_result\n raise self._exception\n File \"/opt/swh/.local/lib/python3.10/site-packages/tenacity/__init__.py\", line 382, in __call__\n result = fn(*args, **kwargs)\n File \"/opt/swh/.local/lib/python3.10/site-packages/swh/loader/package/utils.py\", line 139, in download\n raise ValueError(\nValueError: Failure when fetching https://archive.archlinux.org/packages/p/python/python-3.9.1-2-x86_64.pkg.tar.zst. Checksum mismatched: 32000000 != 33233037"}
2023-11-10 17:38:45 swh-scheduler@db1:5432 λ select now(), name, instance_name, visit_type, enabled, count(*) from listed_origins lo inner join listers l on l.id=lo.lister_id where lister_id in (select id from listers where name='arch') group by name, instance_name, visit_type, enabled order by count asc;+-------------------------------+------+---------------+------------+---------+-------+| now | name | instance_name | visit_type | enabled | count |+-------------------------------+------+---------------+------------+---------+-------+| 2023-11-10 16:38:46.488422+00 | arch | arch | arch | t | 25673 |+-------------------------------+------+---------------+------------+---------+-------+(1 row)Time: 76.253 ms