swh@swh-toolbox-8499ff5477-ghgpp:~$ export SWH_CONFIG_FILENAME=/etc/swh/config-scrubber-journal.ymlswh@swh-toolbox-8499ff5477-ghgpp:~$ swh db init scrubberpsql:/opt/swh/.local/lib/python3.10/site-packages/swh/core/db/sql/36-dbmodule.sql:12: NOTICE: relation "dbmodule" already exists, skippingINFO:swh.core.cli.db:Initializing database version to 7 from the scrubber datastoreDONE database for scrubber initialized at version 7
swh-scrubber=> \d List of relations Schema | Name | Type | Owner--------+--------------------------+----------+-------------- public | check_config | table | swh-scrubber public | check_config_id_seq | sequence | swh-scrubber public | checked_partition | table | swh-scrubber public | corrupt_object | table | swh-scrubber public | datastore | table | swh-scrubber public | datastore_id_seq | sequence | swh-scrubber public | dbmodule | table | swh-scrubber public | dbversion | table | swh-scrubber public | fixed_object | table | swh-scrubber public | missing_object | table | swh-scrubber public | missing_object_reference | table | swh-scrubber public | object_origin | table | swh-scrubber(12 rows)
checker configuration:
swh@swh-toolbox-8499ff5477-ghgpp:~$ for type in directory snapshot release revisiondo swh scrubber check init journal --object-type=$type --name journal-checker-$typedoneCreated configuration journal-checker-directory [1] for checking directory in kafka journalCreated configuration journal-checker-snapshot [2] for checking snapshot in kafka journalCreated configuration journal-checker-release [3] for checking release in kafka journalCreated configuration journal-checker-revision [4] for checking revision in kafka journal
swh-scrubber=> \pset pager offPager usage is off.swh-scrubber=> select * from datastore; id | package | class | instance ----+---------+-------+----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- 1 | journal | kafka | {"brokers": ["kafka1.internal.softwareheritage.org:9094", "kafka2.internal.softwareheritage.org:9094", "kafka3.internal.softwareheritage.org:9094", "kafka4.internal.softwareheritage.org:9094"], "group_id": "swh-archive-prod-journalchecker", "prefix": "swh.journal.objects"}(1 row)Time: 5.210 msswh-scrubber=> select * from check_config; id | datastore | object_type | nb_partitions | check_hashes | check_references | name | comment----+-----------+-------------+---------------+--------------+------------------+---------------------------+--------- 1 | 1 | directory | 4096 | t | f | journal-checker-directory | 2 | 1 | snapshot | 4096 | t | f | journal-checker-snapshot | 3 | 1 | release | 4096 | t | f | journal-checker-release | 4 | 1 | revision | 4096 | t | f | journal-checker-revision |(4 rows)Time: 7.947 ms