- Jul 10, 2024
-
-
Antoine R. Dumont authored
This aligns the behavior of the cli with the other swh modules' cli (e.g. scheduler, deposit, ...). Refs. swh/infra/sysadm-environment#5344
-
- May 30, 2024
-
-
David Douard authored
There are kafka messages where the 'type' attribute may be missing. In this case, query the storage for the related OriginVisit to get the visit type. If it's not available, keep the visit_types empty.
-
- May 15, 2024
-
-
Pierre-Yves David authored
-
- Mar 29, 2024
-
-
David Douard authored
-
- Mar 06, 2024
-
-
Jérémy Bobbio (Lunar) authored
To enable `swh alter removal` to fully remove origins from the archive, we need to be able to remove origins from ElasticSearch. As ElasticSearch indexes a document per origin, referenced by the origin URL, removing references to an origin is as simple as removing the associated document. Closes #4652
-
- Feb 20, 2024
-
-
David Douard authored
Fix a "typo" in paths returned by get_output_mapping()...
-
David Douard authored
Add missing links in editable install to parser source files generated by tree-sitter. Since the Translator can build the extension on the fly, this should work ok for strict editable install. Note that this also fixes the building of the sdist package, including generated files to build the tree-sitter parser extension. Thanks to lunar for this solution.
-
- Feb 19, 2024
-
-
vlorentz authored
They are usually less interesting than original projects.
-
Antoine Lambert authored
Set objects cannot be serialized by the SWH RPC layer so use a list instead to store visit types as in the elasticsearch backend in order for the search memory backend to be used without errors behind a remote proxy. Closes #4653.
-
Antoine Lambert authored
It seems a regression was introduced in latest pytest release in a way that xunit-style teardown_function is no longer called. So turn it into an autouse fixture instead to workaround that issue.
-
- Feb 15, 2024
-
-
Antoine Lambert authored
Previously, origin_search was only returning a list of dict containing a single url field. From now on, the list of visit types and a boolean indicating if an origin was visited by SWH are also included in each returned origin dict. The purpose of that change is to enable the webapp to exploit the list of visit types associated to an origin when searching for origins. Related to swh-web#4786.
-
- Feb 05, 2024
-
-
Antoine Lambert authored
Related to swh/meta#5075.
-
- Dec 04, 2023
-
- Dec 03, 2023
-
-
David Douard authored
-
- Nov 29, 2023
-
-
David Douard authored
-
- Nov 16, 2023
-
-
David Douard authored
Convert README from markdown to ReST to make it embeddable in docs/index.rst
-
- Oct 19, 2023
-
-
Antoine Lambert authored
There is no easy way to get that list dynamically as the query language parser code in C is generated when building the swh-search Python package and not at runtime. So prefer to remove that harcoded list of visit types as it must updated each time a new visit type is added in the archive or searching for such visit type will fail.
-
- Sep 06, 2023
-
-
Antoine Lambert authored
Remove test cases inheritance of unittest.TestCase in order to benefit from all pytest features like fixtures and parametrize. Migrate tests setup and teardown to pytest style. Remove no longer needed hypothesis use.
-
- Jul 21, 2023
-
-
Antoine Lambert authored
The way editable installs are handled by setuptools has changed since release >= 64 and the custom develop command in setup.py is no longer called. As a consequence, the tree-sitter parser code generation was no longer executed when performing an editable install and runtime error was raised when executing the swh-search backend. So use proper way to handle editable install with latest setuptools and ensure backward compatibility with older versions.
-
- Jul 07, 2023
-
-
David Douard authored
It now needs types-click which is indeed a dependency of swh.core[testing].
-
- Jun 28, 2023
-
-
Antoine Lambert authored
Flask 2.3 removed the deprecated before_first_request handler. So ensure to initialize the search backend once after it was instantiated instead.
-
- Feb 26, 2023
-
-
Kumar Shivendu authored
-
- Feb 17, 2023
-
-
Antoine Lambert authored
Related to swh/meta#4960
-
- Feb 16, 2023
-
-
Jérémy Bobbio (Lunar) authored
Related to swh/meta#4959
-
- Feb 02, 2023
-
-
Antoine Lambert authored
This fixes python 3.7 support due to poetry, a dependency of isort, that removed support for that Python version in a recent release.
-
- Dec 19, 2022
-
-
Antoine Lambert authored
In order to remove warnings about /apidoc/*.rst files being included multiple times in toc when building full swh documentation, prefer to include module indices only when building standalone package documentation. Also include them the proper sphinx way. Related to T4496
-
- Oct 18, 2022
-
-
David Douard authored
- pre-commit from 4.1.0 to 4.3.0, - codespell from 2.2.1 to 2.2.2, - black from 22.3.0 to 22.10.0 and - flake8 from 4.0.1 to 5.0.4. Also freeze flake8 dependencies. Also change flake8's repo config to github (the gitlab mirror being outdated).
-
- Oct 10, 2022
-
-
Antoine Lambert authored
Add visit types for new listers and loaders plus those in development.
-
- Oct 07, 2022
-
-
vlorentz authored
This will allow getting the content of documents in ElasticSearch in order to debug
-
- Sep 15, 2022
-
-
Antoine Lambert authored
Apply fix as described in the package build log on unstable: UserWarning: Distutils was imported before Setuptools, but importing Setuptools also replaces the `distutils` module in `sys.modules`. This may lead to undesirable behaviors or errors. To avoid these issues, avoid using distutils directly, ensure that setuptools is installed in the traditional way (e.g. not an editable install), and/or make sure that setuptools is always imported before distutils.
-
Antoine Lambert authored
It can be useful to use a remote search server using the memory backend, for instance in the docker environment to avoid spawning a costly elasticsearch instance. As this was never tested, there was an issue when trying to start such remote search server so fix it and add tests.
-
- Sep 12, 2022
-
-
Antoine Lambert authored
Origin visit types must be explicitely declared in order to search for them, add new ones from loaders in development.
-
- Jul 19, 2022
- Jul 18, 2022
- May 05, 2022
-
-
Antoine Lambert authored
In that case, the search will match all origins and return them in a paginated way. It also enables to consider all origins when filtering them according to different criterions (visit type for instance). These small changes will enable to improve the search features available in swh-web.
-
- Apr 26, 2022
-
- Apr 25, 2022
-
-
Antoine R. Dumont authored
Without this, the following query fails in the archive [1]. While without specifying anything [2], it works. The following commit should fix the issue with the [1] link. [1] https://webapp.staging.swh.network/browse/search/?q=repo1.maven.org&with_visit=true&with_content=true&visit_type=maven [2] https://webapp.staging.swh.network/browse/search/?q=repo1.maven.org&with_visit=true&with_content=true
-