- Jan 21, 2025
-
-
Antoine Lambert authored
This reverts commit 2a1c8db8. The new rdflib 7.1.3 release reverted the typing changes they introduced in 7.1.2 to keep compatibility with Python 3.8, so revert the commit that adapted typings to rdflib 7.1.2.
-
- Jan 13, 2025
-
-
Antoine Lambert authored
-
- Jan 10, 2025
-
-
Antoine Lambert authored
Previously, a generated bibtex citation was using REPLACEME as placeholder for identifier. Prefer to generate a default citation identifier from SWHID if provided.
-
- Dec 18, 2024
-
-
vlorentz authored
This should be enough to fully support Codemeta v3, to the extent of what properties we can actually export to BibTeX
-
- Dec 16, 2024
-
-
https://w3id.org/codemeta/3.0Antoine Lambert authored
This does not mean we fully support CodeMeta 3.0 at the time of writing but it should fix most of the bibtex citation generations when that URL is used as context in codemeta.json file. Fixes #4822.
-
- Dec 13, 2024
-
-
Antoine Lambert authored
Previously, content indexers were only using content sha1 as object identifier but this could lead to errors in production when attempting to fetch content data from an object storage where key is the sha256 checksum of the content (winery for instance). So ensure to use a CompositeObjId dictionary as content object identifier to avoid these kind of issues when indexing content objects.
-
- Dec 09, 2024
-
-
vlorentz authored
-
- Dec 05, 2024
-
-
David Douard authored
- remove the `self.result` attribute and make this an actual output of the `run()` method - refactor the `process_journal_objects` to use `run()` to prevent code duplication.
-
David Douard authored
scheduling indexer tasks is not supported any more for a while now.
-
- Nov 26, 2024
-
-
vlorentz authored
-
- Nov 11, 2024
-
-
This works around citation.cff files with extra spaces in quoted strings
-
- Nov 08, 2024
-
-
Antoine Lambert authored
Previous implementation could lead to errors when an invalid date is present in source codemeta.
-
- Nov 05, 2024
-
-
David Douard authored
The new 'backend' handling mechanism in swh.core 3.6 expect the registered package to be a simple name (without '.'). So this partially revert changes from b681d147, reverting to using 'indexer_storage' as config section name.
-
- Nov 04, 2024
-
-
vlorentz authored
This avoids accidentally interpreting user-provided data as a macro
-
- Oct 31, 2024
-
-
Antoine Lambert authored
When a citation.cff file contains a preferred-citation field, ensure citation metadata are extracted from it. Resolves #4743.
-
- Oct 30, 2024
-
-
- Oct 25, 2024
-
-
David Douard authored
It's now expected the sql/ directory to live in the module implementing the swh.core.db based backend.
-
David Douard authored
But do not fail if no config file is given by default; required to be able to run 'swh indexer mapping list-terms' for example (which is used to build the documentation).
-
David Douard authored
Normalize the indexer storage db for swh.core 3.6 with improved `swh db` handling capabilities. Remove storage/test_init.py, it's now outdated. Warning: this rename the config entry for indexer storages from 'indexer_storage' to 'indexer.storage'. Bw compat core handle this when loading the configuration for indexer cli tools and the api server only; generic loading mechanism will not handle the old name properly (i.e. when managing the storage db using `swh db` commands).
-
- Oct 24, 2024
-
-
vlorentz authored
rdflib implicitly sets xsd:integer as the type of Literal(42); while the ActivityStream specification requires a xsd:nonNegativeInteger here. This was highlighted by a bugfix in rdflib 7.1.0, as previous versions omitted serializing the "default" datatype corresponding to the Python type of the literal value when writing in JSON-LD.
-
David Douard authored
This is only a workaround until these tests are properly fixed. They should probably be converted as actual (py)tests; it would make it easier to support these types of breakage due to formatting changes in rdflib...
-
- Sep 12, 2024
-
-
vlorentz authored
-
- Aug 30, 2024
-
-
Antoine Lambert authored
-
Antoine Lambert authored
-
- Aug 27, 2024
-
-
David Douard authored
-
- Aug 02, 2024
-
-
- Jul 15, 2024
-
-
- Jun 13, 2024
-
-
Antoine Lambert authored
Catch parsing error exceptions when attempting to parse a codemeta document in XML or JSON then print error log and return None.
-
vlorentz authored
Fixes swh/devel/swh-indexer#4738 for existing metadata objects
-
- May 19, 2024
-
- May 15, 2024
-
-
Pierre-Yves David authored
-
Pierre-Yves David authored
-
- May 13, 2024
-
-
vlorentz authored
-
- Apr 22, 2024
-
-
Nicolas Dandrimont authored
-
- Mar 29, 2024
-
-
David Douard authored
-
- Mar 27, 2024
-
-
Antoine Lambert authored
Since the release of pytest 8.1, some pytest options are no longer needed and editable install can be used when running tests using tox.
-
- Feb 09, 2024
-
-
Antoine Lambert authored
-
- Feb 05, 2024
-
-
Antoine Lambert authored
Related to swh/meta#5075.
-
- Feb 02, 2024
-
-
Nicolas Dandrimont authored
-