- Sep 16, 2021
-
-
Antoine R. Dumont authored
Related to T3578
-
Jenkins for Software Heritage authored
-
Jenkins for Software Heritage authored
Update to upstream version '0.37.0' with Debian dir 304ad56e032c6f722cf702178ece7d62e3dcef12
-
Jenkins for Software Heritage authored
-
- Sep 15, 2021
-
-
Antoine R. Dumont authored
This impacts both the `extid_get_from_extid` and `extid_get_from_target` endpoints. Whe extid_version/extid_type are not provided, this keeps the existing behavior of returning all extids matching. Related to T3567
-
- Sep 14, 2021
-
-
vlorentz authored
-
- Sep 10, 2021
- Sep 09, 2021
-
-
vlorentz authored
This should make it run up to 100 times faster, even on average directories.
-
vlorentz authored
Instead of fetching them one-by-one, with the very high latency this entails. This is preliminary work to make `directory_ls` less painfully slow.
-
vlorentz authored
And fall back to concurrent insertion.
-
- Sep 08, 2021
-
-
vlorentz authored
By reusing the 'steady state' main statement (which is quite large) across calls.
-
vlorentz authored
This adds a new config option for the cassandra backend, 'directory_entries_insert_algo', with three possible values: * 'one-per-one' is the default, and preserves the current naive behavior * 'concurrent' and 'batch' are attempts at being more efficient
-
- Sep 06, 2021
-
-
vlorentz authored
This will be used as a second pass on objects that failed with older versions of the script.
-
- Sep 03, 2021
-
-
vlorentz authored
-
- Aug 31, 2021
-
-
vlorentz authored
They were inaccurate and a performance bottleneck. We can/should use swh-counters instead, now.
-
- Aug 30, 2021
-
-
Vincent Sellier authored
resulting in OriginVisitStatus trying to put a snapshot id in the metadata field Related to T3539
-
Vincent Sellier authored
Related to T3517
-
- Aug 27, 2021
-
-
vlorentz authored
It will be used in the Cassandra experiment. Currently we use the built-in counters of the Cassandra backend; but in addition to being inaccurate, they seem to be a bottleneck. This proxy will be a lightweight solution for counting object insertion, without needing to run Kafka on the test cluster.
-
- Aug 24, 2021
-
-
Jenkins for Software Heritage authored
-
Jenkins for Software Heritage authored
Update to upstream version '0.36.0' with Debian dir 179c1ad6d3ce02e0f64d5944d38e3e3d48e86d89
-
Jenkins for Software Heritage authored
-
Nicolas Dandrimont authored
-
vlorentz authored
It was unclear this actually worked; I had to write this test to realize the code wasn't buggy. Also replaced a conditional that is always False (because Cassandra always returns results in the order of the clustering key) with an assertion, so the code is less confusing.
-
vlorentz authored
When called by a replayer, the visit.visit field is set; but origin.next_visit_id was never incremented, so on the next loader run, the visit id would be 1 even if there is already a visit with that id.
-
vlorentz authored
Instead of calling content_find() for each object, which needs to make two queries for each. Given the latency of Cassandra queries, this should be a significant speed-up (possibly up to 100 times faster, as this is the value of PARTITION_KEY_RESTRICTION_MAX_SIZE). This also changes the schema, because CQL does not allow doing `IN` queries on compound partition keys.
-
Vincent Sellier authored
Related to T3485
-
- Aug 20, 2021
-
-
Jenkins for Software Heritage authored
-
Jenkins for Software Heritage authored
Update to upstream version '0.35.1' with Debian dir 313295a88c0c4c4f7c924c7c36bea4c310f2cbca
-
Jenkins for Software Heritage authored
-
- Aug 06, 2021
-
- Jul 28, 2021
-
-
Jenkins for Software Heritage authored
-
Jenkins for Software Heritage authored
Update to upstream version '0.35.0' with Debian dir 6ad84ab1c5fb66c4ead922e0b750291db31def2f
-
Jenkins for Software Heritage authored
-
- Jul 27, 2021
-
-
Antoine R. Dumont authored
-
- Jul 23, 2021
-
-
Nicolas Dandrimont authored
This fields allows having multiple version of the ExtID -> SWHID mapping, for instance when the implementation of a loader changes in a backwards-incompatible way. For now, we don't change the API used to query or store ExtIDs. When querying for the SWHIDs corresponding to a given external objects, all versions are returned, and the client is expected to do the filtering.
-
- Jul 07, 2021
-
-
Jenkins for Software Heritage authored
-
Jenkins for Software Heritage authored
Update to upstream version '0.34.0' with Debian dir d1489fbcf94a7c6a63233ae05f74d2de68f382bc
-
Jenkins for Software Heritage authored
-