- Aug 23, 2022
-
-
vlorentz authored
Atom documents from old deposits usually use these instead of codemeta's
-
vlorentz authored
-
vlorentz authored
-
vlorentz authored
-
vlorentz authored
Motivation: 1. It makes it easier to visualize what is actually happening when modifying the graph, by working explicitly on triples instead of a JSON-LD (a tree serialization of the graph). 2. Remove the need for the hacky `merge_values()` function (and possibly `merge_documents()` in a future commit) 3. It also catches malformed data exactly where it is added in the document (the call to rdflib.Graph.add()) instead of at the end of the mapping when running compaction/expansion. Downsides: 1. Tests are clunkier, because they relied on deterministic order of unordered lists; but rdflib does not guarantee it 2. Code is longer 3. Extra dependency (which we will need at some point if we want to import from RDF datasets, anyway)
-
vlorentz authored
Their order is nondeterministic, it just happens to work with the way we use PyLD.
- Aug 22, 2022
-
-
vlorentz authored
-
vlorentz authored
-
vlorentz authored
-
vlorentz authored
This parameter was only used to execute extra code before `normalize_translation` is called. This caused some duplication, and will not work when switching to a non-JSON-LD internal representation. Removing it also makes the code of mappings more consistent, by removing specific field handling from their implementation of the `translate` method itself.
-
vlorentz authored
-
vlorentz authored
-
- Aug 20, 2022
-
-
Satvik authored
-
- Aug 10, 2022
-
- Aug 08, 2022
-
-
vlorentz authored
When capture_exceptions=False, the indexer's caller reports the exception to Sentry itself. However, because tags were added by indexers within a scope internal to the indexers; the scope was closed before returning to the caller, so these tags were actually not sent to Sentry.
-
vlorentz authored
Without these tags, it is often impossible to find what object caused a given crash without guesswork based on the object's content and swh-graph.
-
vlorentz authored
-
vlorentz authored
-
- Aug 04, 2022
-
-
vlorentz authored
Invalid URLs are a common source of crashes
-
- Aug 03, 2022
-
-
vlorentz authored
-
- Jul 29, 2022
-
-
Antoine R. Dumont authored
Detected through T4406 Related to T4412
-
Antoine R. Dumont authored
So future services to be deployed can match on that name. Related to T4406
-
- Jul 22, 2022
-
-
Antoine R. Dumont authored
The indexer language has been no longer running in production for years (and the related indexer code has been pruned years ago as well). Those related endpoints are not consumed by anyone. So we can drop those. That will ease code maintenance and make the ci gain some time when running the overall tests. Related to T4273
-
Antoine R. Dumont authored
It's been no longer running in production for years. That will ease code maintenance. Related to T4273
-
Antoine R. Dumont authored
Related to T4273
-
Antoine R. Dumont authored
-
Antoine R. Dumont authored
Related to T4273
-
- Jul 21, 2022
- Jul 12, 2022
-
- Jul 11, 2022
-
-
vlorentz authored
-
- Jul 06, 2022
-
-
vlorentz authored
1. indexers call themselves directly instead of going through the scheduler 2. metadata is attached to directories instead of revisions
-
- Jul 05, 2022
-
-
vlorentz authored
-
vlorentz authored
-
vlorentz authored
-
vlorentz authored
It makes resulting documents (usually) shorter, and tests more readable.
-
vlorentz authored
detect_metadata_files and extrinsic_metadata_formats (respectively) are somewhat mutually exclusive, so it does not make much sense to have them in the same class and MAPPINGS dict
-
vlorentz authored
It is already set by _translate_dict itself.
-