- Jan 06, 2023
-
-
Satvik authored
-
- Sep 02, 2022
-
-
vlorentz authored
They cause noisy logs
-
- Sep 01, 2022
- Aug 31, 2022
-
-
Antoine R. Dumont authored
Related to T4477
-
- Aug 30, 2022
-
-
vlorentz authored
Hash dicts are now prefered by swh-objstorage, in order to support individual hash collisions.
-
Antoine R. Dumont authored
Fix issue [1] which prevents indexer journal client from consuming the journal. [1] https://sentry.softwareheritage.org/share/issue/1d3de3b47c234408889bff5c4f4b0d20/ Related to T4459#90001
-
- Aug 25, 2022
-
- Aug 23, 2022
-
-
vlorentz authored
Atom documents from old deposits usually use these instead of codemeta's
-
vlorentz authored
-
vlorentz authored
-
vlorentz authored
-
vlorentz authored
Motivation: 1. It makes it easier to visualize what is actually happening when modifying the graph, by working explicitly on triples instead of a JSON-LD (a tree serialization of the graph). 2. Remove the need for the hacky `merge_values()` function (and possibly `merge_documents()` in a future commit) 3. It also catches malformed data exactly where it is added in the document (the call to rdflib.Graph.add()) instead of at the end of the mapping when running compaction/expansion. Downsides: 1. Tests are clunkier, because they relied on deterministic order of unordered lists; but rdflib does not guarantee it 2. Code is longer 3. Extra dependency (which we will need at some point if we want to import from RDF datasets, anyway)
-
vlorentz authored
Their order is nondeterministic, it just happens to work with the way we use PyLD.
- Aug 22, 2022
-
-
vlorentz authored
-
vlorentz authored
-
vlorentz authored
-
vlorentz authored
This parameter was only used to execute extra code before `normalize_translation` is called. This caused some duplication, and will not work when switching to a non-JSON-LD internal representation. Removing it also makes the code of mappings more consistent, by removing specific field handling from their implementation of the `translate` method itself.
-
vlorentz authored
-
vlorentz authored
-
- Aug 20, 2022
-
-
Satvik authored
-
- Aug 10, 2022
-
- Aug 08, 2022
-
-
vlorentz authored
When capture_exceptions=False, the indexer's caller reports the exception to Sentry itself. However, because tags were added by indexers within a scope internal to the indexers; the scope was closed before returning to the caller, so these tags were actually not sent to Sentry.
-
vlorentz authored
Without these tags, it is often impossible to find what object caused a given crash without guesswork based on the object's content and swh-graph.
-
vlorentz authored
-
vlorentz authored
-
- Aug 04, 2022
-
-
vlorentz authored
Invalid URLs are a common source of crashes
-
- Aug 03, 2022
-
-
vlorentz authored
-
- Jul 29, 2022
-
-
Antoine R. Dumont authored
Detected through T4406 Related to T4412
-
Antoine R. Dumont authored
So future services to be deployed can match on that name. Related to T4406
-
- Jul 22, 2022
-
-
Antoine R. Dumont authored
The indexer language has been no longer running in production for years (and the related indexer code has been pruned years ago as well). Those related endpoints are not consumed by anyone. So we can drop those. That will ease code maintenance and make the ci gain some time when running the overall tests. Related to T4273
-
Antoine R. Dumont authored
It's been no longer running in production for years. That will ease code maintenance. Related to T4273
-
Antoine R. Dumont authored
Related to T4273
-
Antoine R. Dumont authored
-
Antoine R. Dumont authored
Related to T4273
-
- Jul 21, 2022