Skip to content
Snippets Groups Projects
  1. Oct 17, 2023
  2. Oct 16, 2023
  3. Oct 12, 2023
  4. Oct 11, 2023
  5. Oct 10, 2023
  6. Oct 04, 2023
  7. Oct 03, 2023
  8. Oct 02, 2023
  9. Sep 28, 2023
  10. Sep 27, 2023
  11. Sep 26, 2023
    • Antoine R. Dumont's avatar
      Revert "Update metrics extraction for swh-scrubber v2" · 6de0c152
      Antoine R. Dumont authored
      This reverts commit 8d603bd5.
      
      It's not working as expected [1].
      Scrubber db version 2.0.3 (top-level commit of the repo).
      Db migrated accordingly [3]
      The update scrubber prometheus query from this change is failing to execute.
      And as an unfortunate side-effect, this fails the other prometheus queries [1].
      Resulting in alerts [2].
      
      [1] production extract:
      ```
      root@belvedere:~# journalctl -xef -u prometheus-sql-exporter.service | grep "{" | cut -d':' -f4- | jq .
      ...
      {
        "caller": "job.go:205",
        "err": "pq: column checked_partition.datastore does not exist",
        "job": "12/secondary/swh-scrubber/swh_scrubber_coverage",
        "level": "warn",
        "msg": "Failed to run query",
        "query": "swh_scrubber_coverage",
        "ts": "2023-09-26T14:24:59.91138679Z"
      }
      {
        "caller": "job.go:205",
        "err": "zero rows returned",
        "job": "12/main/postgres",
        "level": "warn",
        "msg": "Failed to run query",
        "query": "pg_stat_user_tables",
        "ts": "2023-09-26T14:55:06.562692017Z"
      }
      ```
      [2]
      ```
      16:43 <+swhbot> icinga PROBLEM: service Postgresql replication lag (belvedere -> massmoca) on belvedere.internal.softwareheritage.org is UNKNOWN: UNKNOWN - unable to parse prometheus response pg replication_lag belvedere massmoca is null
      ```
      
      [3] swh/infra/sysadm-environment#4992
      6de0c152
    • David Douard's avatar
      Update metrics extraction for swh-scrubber v2 · 8d603bd5
      David Douard authored
      Adapt the SQL query for the swh_scrubber_coverage metrics and add a new
      swh_scrubber_ongoing_check one giving some insight on the currently
      running checks.
      8d603bd5
  12. Sep 25, 2023
  13. Sep 22, 2023
  14. Sep 20, 2023
  15. Sep 19, 2023
  16. Sep 13, 2023
  17. Sep 11, 2023
  18. Sep 08, 2023
  19. Sep 05, 2023
  20. Sep 04, 2023
Loading