Skip to content
Snippets Groups Projects
  1. Nov 23, 2022
  2. Nov 21, 2022
  3. Nov 03, 2022
  4. Nov 02, 2022
  5. Oct 27, 2022
  6. Oct 26, 2022
  7. Oct 25, 2022
  8. Oct 24, 2022
  9. Oct 18, 2022
  10. Oct 07, 2022
  11. Sep 28, 2022
  12. Sep 27, 2022
  13. Sep 12, 2022
  14. Sep 08, 2022
    • vlorentz's avatar
      npm: Do not generate URIs with spaces in them · 6d7efad9
      vlorentz authored
      It makes rdflib complain, and is invalid anyway
      v2.6.0
      6d7efad9
    • vlorentz's avatar
      Convert SWHID to str before passing to sentry_sdk.set_tag · f4e08f95
      vlorentz authored
      Sentry uses repr() by default, which does not look good in a UI
      f4e08f95
    • vlorentz's avatar
      Fix crash when indexing the same directory twice with non-deterministic order · b6385cec
      vlorentz authored
      persist_index_computations deduplicated row entries based on the entire
      content of the row; but postgresql enforces the 'id' should be unique.
      
      This was not an issue in older version of swh-indexer, because all
      operations were deterministic, given a specific directory as input.
      
      The recent switch to rdflib introduced non-determinism, so different
      outputs may be returned for the same directory id; causing the
      deduplication to not be good enough to avoid duplicate ids.
      
      With this commit, deduplication is now done on 'id', as expected.
      
      As a side-effect, persist_index_computations is now more efficient
      because:
      
      1. it runs in linear time instead of quadratic in the number of
         metadata items
      2. it only compares dir ids, instead of the content of indexed metadata
         (which is arbitrarily large JSON-like data)
      b6385cec
    • vlorentz's avatar
      github: Add support for 'topics' · dd027419
      vlorentz authored
      dd027419
Loading