Skip to content
Snippets Groups Projects
  1. Apr 06, 2021
  2. Mar 30, 2021
  3. Mar 26, 2021
  4. Mar 22, 2021
  5. Mar 15, 2021
    • vlorentz's avatar
      Document the existing metadata formats · 8dd9f7b6
      vlorentz authored
      v0.25.0
      8dd9f7b6
    • vlorentz's avatar
      content_add: Write to the objstorage before the DB or Kafka · ffc0841b
      vlorentz authored
      Must add to the objstorage before the DB and journal. Otherwise:
      1. in case of a crash the DB may "believe" we have the content, but
         we didn't have time to write to the objstorage before the crash
      2. the objstorage mirroring, which reads from the journal, may attempt to
         read from the objstorage before we finished writing it
      
      This is already done in the postgresql backend unintentionally since
      209de5db.
      
      This commit documents it, makes the cassandra backend behave that way too,
      and adds a test.
      ffc0841b
  6. Mar 12, 2021
    • Antoine Lambert's avatar
      storage: Allow to filter out branches by prefix when counting them · b565201d
      Antoine Lambert authored
      Add an optional branch_name_exclude_prefix parameter to the
      snapshot_count_branches method of the Storage interface.
      
      It enables to filter out branches whose name starts with a
      given prefix when counting.
      
      The purpose is to get accurate counters in swh-web as pull
      request branches will be filtered out by default.
      
      Related to T2782
      b565201d
    • Antoine Lambert's avatar
      storage: Add branch names filtering support in snapshot_get_branches · 93301a1f
      Antoine Lambert authored
      Add optional branch_name_include_substring parameter to snapshot_get_branches,
      if provided only branches whose name contains the given substring will be
      returned.
      
      Add optional branch_name_exclude_prefix parameter to snapshot_get_branches,
      if provided branches whose name starts with the given prefix will not be
      returned.
      
      Purpose of these new features: add a search form in the branches view
      of swh-web and filter out pull request branches (whose names start with
      "refs/pull/") by default.
      
      Related to T2782
      93301a1f
  7. Mar 11, 2021
    • David Douard's avatar
      Add ExtID query support to the Storage · b8e10f00
      David Douard authored
      These endpoints allow to add and query the storage for known ExtID from SWHID
      (typically get original VCS' revision intrinsic identifier from SWHID).
      
      The underlying data structure is to be filled typically by loaders using
      the `extid_add()` endpoint.
      
      This only provides the Postgresql implementation.
      
      Related to T2849.
      b8e10f00
  8. Mar 10, 2021
  9. Mar 03, 2021
  10. Mar 01, 2021
  11. Feb 25, 2021
  12. Feb 19, 2021
    • Antoine Lambert's avatar
      storage: Implement visit types filtering in origin_search method · f3ef6e6a
      Antoine Lambert authored
      Enable to filter searched origins by visit types.
      
      Add a new optional visit_types parameter to origin_search method in
      StorageInterface.
      
      Implement visit types filtering in storage backends, an origin wil be
      returned if it has any of the requested visit types.
      
      This is clearly not designed to be used in production due to performance
      issues but rather in testing environments with small archive dataset.
      
      Related to T2869
      v0.23.2
      f3ef6e6a
  13. Feb 17, 2021
  14. Feb 16, 2021
  15. Feb 09, 2021
  16. Feb 08, 2021
  17. Feb 04, 2021
    • Nicolas Dandrimont's avatar
      buffer: ensure objects are flushed in topological order · efd8815b
      Nicolas Dandrimont authored
      This new integration test checks that, when flushing the buffer storage,
      the addition functions of the underlying storage backend are called in
      topological order (content, directory, revision, release then snapshot).
      
      This reduces the probability of "data consistency" regressions caused by
      the use of the buffering storage proxy alone.
      efd8815b
    • Nicolas Dandrimont's avatar
      Return an accurate summary from buffer's flush() method · 1526107b
      Nicolas Dandrimont authored
      The earlier implementation would only return summary data from keys that
      existed in the last `_add` backend method run, rather than collating all
      the results.
      1526107b
    • Nicolas Dandrimont's avatar
      buffer: add support for snapshots · 5b3e6c9f
      Nicolas Dandrimont authored
      This is mostly a consistency addition, considering that most (if not
      all) loaders will only add a single snapshot.
      
      The common pattern of loading objects in topological order (content >
      directory > revision > release > snapshot), then flushing the storage,
      is now fully consistent; Without this addition, the snapshot addition
      would reach the backend storage before all other objects are added,
      leading to potential inconsistencies if the flush of other object types
      fails.
      5b3e6c9f
    • Nicolas Dandrimont's avatar
      18967ed4
  18. Feb 01, 2021
Loading