Skip to content
Snippets Groups Projects
  1. Jul 10, 2023
  2. Nov 15, 2022
  3. Oct 07, 2022
    • Antoine Lambert's avatar
      rubygems: Use gems database dump to improve listing output · 108816f2
      Antoine Lambert authored
      Instead of using an undocumented rubygems HTTP endpoint that only
      gives us the names of the gems, prefer to exploit the daily PostgreSQL
      dump of the rubygems.org database.
      
      It enables to list all gems but also all versions of a gem and its
      release artifacts. For each relase artifact, the following info are
      extracted: version, download URL, sha256 checksum, release date
      plus a couple of extra metadata.
      
      The lister will now set list of artifacts and list of metadata as extra
      loader arguments when sending a listed origin to the scheduler database.
      A last_update date is also computed which should ensure loading tasks
      for rubygems will be scheduled only when new releases are available since
      last loadings.
      
      To be noted, the lister will spawn a temporary postgres instance so this
      require the initdb executable from postgres server installation to be
      available in the execution environment.
      
      Related to T1777
      108816f2
  4. Aug 09, 2022
  5. Aug 05, 2022
  6. Apr 21, 2022
  7. Dec 08, 2021
  8. Nov 29, 2021
    • Boris Baldassari's avatar
      lister: Add new maven lister · 8991c625
      Boris Baldassari authored
      The Maven lister retrieves the maven central indexes, exports them in a
      convenient text format, and parse them to identify all src archives and
      pom files in the maven repository. Then the pom files are downloaded and
      analysed to find and yield any scm reference.
      
      Note: This is a new version of the maven lister diff D6133 which takes
      into account the initial round of reviews.
      
      Related to T1724
      8991c625
  9. Feb 05, 2021
  10. Feb 02, 2021
    • Antoine Lambert's avatar
      Remove no longer used legacy Lister API and update CLI options · 89335445
      Antoine Lambert authored
      Legacy Lister classes from the swh.lister.core mdule are no longer
      used in swh-lister codebase so it is time to remove them.
      
      Also remove lister CLI options related to legacy Lister API.
      
      As a consequence, the following requirements are no longer needed:
      arrow, SQLAlchemy, sqlalchemy-stubs and testing.postgresql.
      
      Closes T2442
      89335445
    • Antoine Lambert's avatar
      gnu: Remove dependency on pytz · 82ab96ad
      Antoine Lambert authored
      UTC timezone settings can be obtained from the datetime.timezone
      module from Python standard library so remove dependency on external
      pytz module.
      82ab96ad
  11. Jan 18, 2021
    • Antoine Lambert's avatar
      lister: Add utility decorator to ease HTTP requests rate limit handling · d1fbccd9
      Antoine Lambert authored
      Add swh.lister.utils.throttling_retry decorator enabling to retry a
      function that performs an HTTP request who can return a 429 status code.
      
      The implementation is based on the tenacity module and it is assumed
      that the requests library is used when querying an URL.
      
      The default wait strategy is based on exponential backoff.
      
      The default max number of attempts is set to 5, HTTPError exception
      will then be reraised.
      
      All tenacity.retry parameters can also be overridden in client code.
      d1fbccd9
  12. Apr 11, 2020
  13. Nov 04, 2019
  14. Jun 28, 2019
    • Archit Agrawal's avatar
      swh.lister.cgit · b972a2a8
      Archit Agrawal authored
      Implemented a lister to list the repos for a given CGit instance.
      
      Closes T1659
      b972a2a8
  15. Feb 01, 2019
  16. Oct 30, 2017
  17. Sep 05, 2017
  18. Apr 12, 2017
  19. Mar 06, 2017
    • Avi Kelman's avatar
      Refactor lister code · 68d77fd4
      Avi Kelman authored
      Streamline production of new listers by aggressively moving core
      functionality into progressively inherited (A->B->C) base classes
      with the transport layer abstracted.
      This should make common individual forge listers straightforward to
      produce with minimal customization. Github and Bitbucket listers
      can be used as examples of the indexing type.
      68d77fd4
  20. Feb 09, 2017
  21. Dec 15, 2016
  22. Oct 20, 2016
  23. Oct 19, 2016
  24. Sep 13, 2016
  25. Mar 17, 2016
  26. Mar 09, 2016
  27. Sep 21, 2015
Loading