- Feb 17, 2022
-
-
Antoine Lambert authored
CVS projects are different from other VCS ones, they use the rsync protocol, a list of modules needs to be fetched from an info page and multiple origin URLs can be produced for a same project. Related to T3789
-
Antoine R. Dumont authored
as the scheduler is now able to deduplicate it when recording listed origins. Related to T3945
-
Antoine R. Dumont authored
Related to T3945
-
Antoine R. Dumont authored
Prior to this commit, the listing could fail when either reading a page or the page of results (lauchpad api raises RestfulError). This now retries when those kind of exceptions happen. If the error persists (after multiple tryouts and exponential backoff), the listing continues nonetheless (with warning logs). Note that if the page ends up being empty, it's no longer accounted for. This actually allows the listing to finish in case of issues. Related to T3945
-
- Feb 16, 2022
-
-
Antoine R. Dumont authored
Related to T3945
-
- Feb 14, 2022
-
-
Raphaël Gomès authored
Bazaar support was removed a long time ago and predates a lot of the new mechanisms in place in the API. Unfortunately, it looks like a lot of the URLs are offline now, but there are still a few projects that can be listed, this is pretty low-effort.
-
- Feb 10, 2022
-
-
Antoine Lambert authored
-
Antoine Lambert authored
To install the new hook: $ pre-commit install -t commit-msg
-
- Feb 09, 2022
-
-
Antoine R. Dumont authored
We need to avoid using naive datetime as this fails during conversion. Related to T3746 Related to P1280
-
- Feb 08, 2022
-
-
Boris Baldassari authored
-
Boris Baldassari authored
-
- Feb 07, 2022
-
-
Antoine R. Dumont authored
Related to T3916
-
- Jan 25, 2022
-
-
Antoine R. Dumont authored
-
Antoine R. Dumont authored
And log the metadata dictionary.
-
- Jan 24, 2022
-
-
vlorentz authored
-
- Jan 21, 2022
-
-
vlorentz authored
-
- Jan 19, 2022
-
-
Antoine R. Dumont authored
Related to D6967
-
- Dec 16, 2021
-
-
Antoine R. Dumont authored
This also drops spurious copyright headers to those files if present. Related to T3812
-
- Dec 08, 2021
-
-
Antoine Lambert authored
Now that we have packaged tenacity 6.2 for debian buster and use it in production, we can remove the workarounds to support tenacity < 5.
-
- Dec 07, 2021
-
-
vlorentz authored
I would like to use it as the metadata authority URI in the loader, instead of '{p_url.scheme}://{p_url.netloc}/', which I do not think is accurate, as it is possible to have multiple Maven instances at the same netloc.
-
- Dec 06, 2021
-
-
Antoine Lambert authored
A debian package can have sources coming from multiple suites so we need to ensure to update the last_update field in the ListedOrigin model if the current processed suite has a greater modification time for its sources index. Related to T2400
-
- Dec 03, 2021
-
-
Antoine Lambert authored
Use the value of the "Last-Modified" header from the HTTP response resulting of the debian sources index HTTP request. It will prevent to create loading tasks for debian packages with no changes since last listing. Related to T2400
-
Antoine Lambert authored
All debian suites do not necessarily have the same set of components. So prefer to log that a component is missing for a suite instead of raising an excption that will stop the listing.
-
Antoine Lambert authored
Remove no longer used date parameter in extra_loader_arguments. Related to T2400
-
- Dec 02, 2021
-
-
Antoine Lambert authored
For a given package, the debian lister generates a dictionary mapping distribution and version to a list of files to be processed by the debian loader. For each file to process, the debian loader expects to find an URI in order to download it and then use its content to ingest package source code into the archive. However, it turns out these URIs were not computed by the lister in its current implementation making any debian loading task fail due to these missing info. So add the computation of these URIS and ensure they will be provided in the debian loader input parameters. Related to T2400
-
- Dec 01, 2021
-
-
Nicolas Dandrimont authored
In some circumstances, GitHub will return two separate repos with the same html_url in the same page. This makes the lister fail with a cardinality error.
-
- Nov 29, 2021
-
-
Boris Baldassari authored
The Maven lister retrieves the maven central indexes, exports them in a convenient text format, and parse them to identify all src archives and pom files in the maven repository. Then the pom files are downloaded and analysed to find and yield any scm reference. Note: This is a new version of the maven lister diff D6133 which takes into account the initial round of reviews. Related to T1724
-
- Nov 23, 2021
-
-
Antoine R. Dumont authored
This fixes the master build [1] [1] https://jenkins.softwareheritage.org/view/swh-draft/job/DLS/job/tests/1625/console
-
- Nov 10, 2021
-
-
Antoine R. Dumont authored
-
- Nov 09, 2021
-
- Oct 22, 2021
-
-
Antoine Lambert authored
Related to T3645
-
Antoine Lambert authored
CRAN origins must be loaded with the cran visit type and not the tar one. Related to T3675
-
- Oct 11, 2021
-
-
Antoine R. Dumont authored
Related to T3470
-
- Oct 08, 2021
-
-
Antoine R. Dumont authored
Related to T3629
-
Antoine R. Dumont authored
This does not yet enter into the registration of a new lister. Related to T3629
-
- Sep 24, 2021
-
-
Antoine R. Dumont authored
That avoids having multiple distinct opam root directories per opam lister instance. The current opam commands used by the lister are actually listing specifically per instance. Related to P1171
-
- Sep 21, 2021
-
-
Antoine R. Dumont authored
Any extra state initialization (outside the scheduler scope) is to happen in the get_pages method.
-
Antoine R. Dumont authored
Related to T3590
-
Antoine R. Dumont authored
This matches how it's done for all other multi instances listers. Related to T3590
-
Antoine R. Dumont authored
We should avoid side-effects in the constructor as much as possible. That avoids surprising behavior at object instantiation time. The state if needed must be initialized into the `swh.lister.pattern.Lister.get_pages` method, as preconized in the class docstring. This also fixes the current test that actually bootstrap a real opam local "clone" in /tmp. Related to T3590
-