Skip to content
Snippets Groups Projects
  1. May 15, 2023
  2. Apr 26, 2023
  3. Apr 25, 2023
  4. Apr 13, 2023
    • Antoine Lambert's avatar
      package/utils: Use http_retry decorator from swh.core.retry · 900f8184
      Antoine Lambert authored
      The http_retry decorator from swh-lister has been moved to swh-core so
      we can now use it in swh-loader-core instead of duplicating retry code.
      
      Moreover, it also enables to retry HTTP requests on errors like 502, 503
      or 504 instead on simply retrying on 429.
      900f8184
  5. Apr 06, 2023
  6. Apr 05, 2023
    • Antoine Lambert's avatar
      debian: Handle invalid version string when parsing changelog · 9868dcbc
      Antoine Lambert authored
      Some packages can have an invalid version string in their changelog
      which raises a ValueError when attempting to parse it in the
      get_intrinsic_package_metadata method of the loader.
      
      As a consequence each package release with such bogus entry in the
      changelog was discarded from the snapshot created by the loader.
      
      So prefer to get the raw string version instead of parsing it to
      workaround that issue.
      
      Fix #1493.
      9868dcbc
    • Antoine R. Dumont's avatar
      Make core loader use swh.loader.core.nar module · f41a28d1
      Antoine R. Dumont authored
      This pushes options from cli to the Nar class. This also bootstraps tests reusing
      existing test cases from before (using the nix binary).
      
      Refs. swh/meta#4979
      f41a28d1
    • Antoine R. Dumont's avatar
      Add swh.loader.core.nar module · 23cb3549
      Antoine R. Dumont authored
      This also exposes a `swh nar` cli.
      
      ```
      $ Usage: swh nar [OPTIONS] DIRECTORY
      
        Compute NAR hashes on a directory.
      
      Options:
        -x, --exclude-vcs               exclude version control directories
        -H, --hash-algo [sha256|sha1]
        -f, --format-output [hex|base32|base64]
        --debug / --no-debug
        -h, --help                      Show this message and exit.
      ```
      
      Refs. swh/meta#4979
      23cb3549
  7. Apr 04, 2023
  8. Mar 27, 2023
    • Jérémy Bobbio (Lunar)'s avatar
      Fix presentation of the loader specification table · f47294fd
      Jérémy Bobbio (Lunar) authored
      After switching to the PyData Sphinx theme, the very large table
      with the loader specification became too wide to be readable.
      These changes make the table scrollable again, remove the right sidebar,
      highlight the name column and add stripe to rows.
      
      They are stopgap measure as such a table might not the best way to
      present this information.
      f47294fd
  9. Mar 08, 2023
  10. Feb 23, 2023
  11. Feb 20, 2023
  12. Feb 17, 2023
  13. Feb 13, 2023
  14. Feb 02, 2023
  15. Jan 13, 2023
    • vlorentz's avatar
      Move py.typed from swh/loader/{package,core}/ to swh/loader/ · fed8fc3e
      vlorentz authored
      There is code in swh/loader/cli.py, and swh-loader-metadata will need
      to import cli.py, causing mypy to complain when py.typed is missing.
      fed8fc3e
    • Antoine R. Dumont's avatar
      Allow partial snapshot creation during ingestion · fc1adf07
      Antoine R. Dumont authored
      This introduces a `create_partial_snapshot` parameter to the base loader constructor.
      When activated, during each call of the `store_data` method, if there are more data to
      fetch, this will create a partial snapshot (and an associated visit status).
      
      The final loop behaves as before, create the last visit with status 'full' targeting the
      final snapshot.
      
      The main difference between the 2 behaviors is that an ingestion with that parameter on
      is more verbose in terms of origin_visit_status. This, in turn, allows to be incremental
      in subsequent visits for the same origin. This may especially be interesting for cases
      when loading fail due to out of hand resources issues (e.g. large svn or git
      repositories).
      
      Related to T3625
      fc1adf07
  16. Dec 20, 2022
    • Antoine Lambert's avatar
      conda: Fix versions sorting and update release names · a63b39e5
      Antoine Lambert authored
      Release 22.0 of packaging module can no longer parse invalid Python version
      number, an exception is now raised.
      
      Conda loader used the keys of the packages dict as version numbers to sort,
      which are in the form "<arch>/<version>-<build>", but those cannot be parsed
      anymore.
      
      So extract intrinsic version numbers of packages instead to sort the list of
      versions.
      
      Also update snapshot release names to "<version>-<build>-<arch>" as each
      release for a given architecture targets a different directory.
      a63b39e5
    • Antoine Lambert's avatar
      rpm: Fix package versions sorting · b6231045
      Antoine Lambert authored
      Release 22.0 of packaging module can no longer parse invalid Python version
      number, an exception is now raised.
      
      RPM loader used the keys of the packages dict as version numbers to sort,
      which are in the form "<distribution>/<edition>/<package_version_number>",
      but those cannot be parsed anymore.
      
      So use intrinsic version numbers of packages instead to sort the list of
      versions.
      b6231045
    • vlorentz's avatar
      e7ac7a34
  17. Dec 19, 2022
  18. Nov 21, 2022
  19. Nov 16, 2022
  20. Nov 15, 2022
  21. Nov 14, 2022
  22. Nov 03, 2022
  23. Nov 02, 2022
  24. Oct 27, 2022
  25. Oct 26, 2022
Loading