Skip to content
Snippets Groups Projects
  1. Jul 10, 2023
    • Antoine R. Dumont's avatar
      Add Gitweb lister · 573958ce
      Antoine R. Dumont authored
      Depending on some instances, we have some specific heuristics, some instances:
      - have summary pages which do not not list metadata_url (so some
        computation happens to list git:// origins which are cloneable)
      - have summary page which reference metadata_url as a multiple comma separated urls
      - lists relative urls of the repository so we need to join it with the main instance url
        to have a complete cloneable origins (or summary page)
      - lists "down" http origins (cloning those won't work) so lists those as cloneable https
        ones (when the main url is behind https).
      
      Refs. swh/devel/swh-lister#1800
      573958ce
  2. Jun 23, 2023
  3. Mar 14, 2023
  4. Nov 15, 2022
  5. Oct 03, 2022
  6. Sep 29, 2022
  7. Sep 27, 2022
  8. Aug 30, 2022
  9. Aug 29, 2022
  10. Aug 26, 2022
  11. Aug 19, 2022
  12. Aug 03, 2022
  13. Jun 15, 2022
    • Franck Bret's avatar
      Add arch lister module (origins from archives). · 1bf11aa2
      Franck Bret authored
      After a first attempt with D7812 this one use a different strategy to
      retrieve origins.
      
      Fetch and extract "core.files.tar.gz", "extra.files.tar.gz" and "community.files.tar.gz" from archives.archlinux.org. That step ensure that we have a list of "official" packages.
      Parse metadata from 'desc' file to build origins url.
      Scrap the origin url to get artifacts metadata that list all versions of a package.
      
      It also fetch and extract unofficial 'arm' packages from archlinuxarm.org but in this case we can not get all versions of an arm package.
      
      Related T4233
      1bf11aa2
  14. Mar 28, 2022
  15. Nov 29, 2021
    • Boris Baldassari's avatar
      lister: Add new maven lister · 8991c625
      Boris Baldassari authored
      The Maven lister retrieves the maven central indexes, exports them in a
      convenient text format, and parse them to identify all src archives and
      pom files in the maven repository. Then the pom files are downloaded and
      analysed to find and yield any scm reference.
      
      Note: This is a new version of the maven lister diff D6133 which takes
      into account the initial round of reviews.
      
      Related to T1724
      8991c625
  16. Jul 06, 2021
  17. May 26, 2021
    • Boris Baldassari's avatar
      tuleap: initialise lister. · 04c0a507
      Boris Baldassari authored
      tuleap-lister: fix args in test_task.
      
      tuleap-lister: Add rate-limiting test + fix debug and typo.
      
      tuleap-lister: code review: fix mocker + tests/setup_cli.
      
      tuleap-lister: code review: fix relister > lister.
      
      tuleap-lister: code review: fix test_task kwargs.
      
      tuleap-lister: code review: Remove authentication useless lines + fix typos.
      
      tuleap-lister: code review: improve results_simplified for svn repos.
      
      tuleap-lister: code review: add name to CONTRIBUTORS file.
      
      tuleap-lister: code review: Update tutorial for misc files to edit.
      
      tuleap-lister: code review: Update copyright to 2021 exactly.
      
      tuleap-lister: code review: Update py files perms -X.
      
      tuleap-lister: code review: minimise json files.
      
      tuleap-lister: code review: fix chmod on json files.
      
      tuleap-lister: code review: fix var names + add tests.
      
      tuleap-lister: code review: fix useless indirection.
      
      tuleap-lister: code review: Add empty repo test, minor typo fixes.
      04c0a507
  18. Mar 23, 2021
    • Raphaël Gomès's avatar
      Add a non-incremental sourceforge lister · f7b27c69
      Raphaël Gomès authored
      Following zack's work on T735, this change introduces an actual SWH lister for
      SourceForge.
      
      SourceForge provides a main sitemap that lists sharded sitemaps, which
      themselves list pages. Each page belongs to a project (or sub-project,
      though those are rare), information about which can be found by querying
      a REST API, which gives us the list of any and all VCS used for said
      project. Both sitemaps and pages have a "last modified" timestamp that
      will be used in a future patch to implement incremental listing.
      
      More precise information can be found as inline comments or docstrings.
      f7b27c69
  19. Sep 23, 2020
  20. Sep 17, 2020
  21. Aug 25, 2020
  22. Jun 25, 2020
  23. Jun 10, 2020
  24. Apr 29, 2020
  25. Apr 20, 2020
  26. Apr 11, 2020
  27. Apr 08, 2020
    • David Douard's avatar
      Enable black · 93a4d8b7
      David Douard authored
      - blackify all the python files,
      - enable black in pre-commit,
      - add a black tox environment.
      93a4d8b7
  28. Nov 22, 2019
  29. Sep 20, 2019
  30. Sep 03, 2019
    • David Douard's avatar
      implement listers as plugins · e3c0ea9d
      David Douard authored
      Listers are declared as plugins via the `swh.workers` entry_point.
      
      As such, the registry function is expected to return a dict with the
      `task_modules` field (as for generic worker plugins), plus:
      
      - `lister`: the lister class,
      - `models`: list of SQLAlchemy models used by this lister,
      - `init` (optionnal): hook (callable) used to initialize the lister's state
        (typically, create/initialize the database for this lister).
        If not set, the default implementation creates database tables (after
        optionally having deleted exisintg ones) according to models declared in
        the `models` register field.
      
      There is no need for explicitely add lister task modules in the main
      `conftest` module, but any new/extra lister to be tested must be registered
      (the tested lister module must be properly installed in the test environment).
      
      Also refactor a bit the cli tools:
      - add support for the standard --config-file option at the 'lister' group
        level,
      - move the --db-url to the 'lister' group,
      - drop the --lister option for the `swh lister db-init` cli tool:
        initializing (especially with --drop-tables) the database for a single
        lister is unreliable, since all tables are created using a sibgle MetaData
        (in the same namespace).
      e3c0ea9d
  31. Jun 28, 2019
  32. May 22, 2019
  33. Feb 06, 2019
  34. Oct 30, 2018
  35. Oct 23, 2018
Loading