- Apr 26, 2021
-
-
Antoine Lambert authored
Enable to check package documentation can be built without producing sphinx warnings. The sphinx environment is designed to be used in continuous integration in order to prevent breaking documentation build when committing changes. The sphinx-dev environment is designed to be used inside a full swh development environment. Related to T3258
-
- Apr 15, 2021
-
-
vlorentz authored
-
- Mar 27, 2021
-
-
Aastha Asthana authored
-
- Mar 26, 2021
-
-
Daniele Serafini authored
-
- Mar 10, 2021
-
-
Kumar Shivendu authored
Clean scan outputs for ndjson, json, and text formats
-
- Mar 08, 2021
-
-
Antoine Lambert authored
-
vlorentz authored
-
- Mar 06, 2021
-
-
vlorentz authored
Reviewers: #reviewers Differential Revision: https://forge.softwareheritage.org/D5211
-
- Mar 01, 2021
-
-
Antoine Lambert authored
-
Antoine Lambert authored
pytest-flask removed the url method of its LiveServer class wrapped by the live_server fixture. So use recommended way to get an URL when using Flask, this requires adding a dummy root endpoint to the API in order to get its base URL.
-
vlorentz authored
-
vlorentz authored
-
- Dec 19, 2020
-
-
Stefano Zacchiroli authored
This makes DB import much more rosbut, at the price of ~15% performance impact (which is still very much bearable, in light of recent DB import improvements). If needed, this cost can be mitigated in the future by pipelining SWHID validation with SQLite insertion, using either a thread or asyncio. Closes T2813
-
Stefano Zacchiroli authored
- avoid loading into memory the entire input file, and rely on DB unicity constraints to spot duplicated - tune sqlite disabling sync writes and journal; this is unsafe, but insertion is all or nothing anyway - minor: improve exception handling, propagating SQLite errors up the stack With this chagen import time for 30M SWHIDs went down from ~6m30s to ~55s, and memory usage down from 5 GiB to a few tens MiB Closes T2836 Closes T2812
-
Stefano Zacchiroli authored
- preserve error message from DBError exception - do *not* remove DB if it exists, as it might help in debugging - fail using click's fail() instead of bypassing it with sys.exit
-
- Nov 25, 2020
-
-
Stefano Zacchiroli authored
thanks Alexios Zavras for the bug report and fix suggestion Closes T2680
-
Stefano Zacchiroli authored
-
- Nov 24, 2020
-
-
Daniele Serafini authored
Serve the local sqlite database generated with the "db import" option. The API endpoint is compatible with the one used in the Web API [1]. [1] https://archive.softwareheritage.org/api/1/known/doc/
-
- Nov 23, 2020
-
-
Stefano Zacchiroli authored
in particular: - make both -i and -o required, as they have no sane defaults ("-" can still be passed to -i to read from stdin) - improve help message for that command
-
- Nov 21, 2020
-
-
Daniele Serafini authored
- import SWHIDs from an input file - new database class to store local SWHIDs (extracted from the input file)
-
- Oct 22, 2020
-
-
Antoine R. Dumont authored
This fixes the build [1] [1] https://jenkins.softwareheritage.org/job/DTSCN/job/tests/297/console
-
- Oct 13, 2020
-
-
Tenma authored
- add smoke and unit tests - mock scanner to not run actual scan - spy on config loading to verify how config is loaded - use a sample test config file and add it to MANIFEST
-
Tenma authored
-
Tenma authored
Conflicts arised between multiple users of the same session-scoped fixtures based on tmp_path_factory. Fix both the fixture itself to avoid side-effects and tests based on its inner workings.
-
Tenma authored
It makes cli more testable and helps avoid ambiguity between uses of 'scan' and 'scanner' for the reader.
-
Daniele Serafini authored
since the model already provide a function to iterate the child nodes, _iter_nodes_attr is not useful function to_dict updated to reflect changes Closes T2690
-
- Oct 08, 2020
-
-
Tenma authored
Also rename private methods from leading 2 underscores to 1 to match project coding style
-
Tenma authored
Remove reliance on default arg child_nodes which is a dict. It is unused in client code, breaks tests, and is not needed to build the dict representation of the tree. This also refines types on related impacted methods which helps reasoning about them.
-
- Oct 05, 2020
-
-
Tenma authored
The docstring was moved out of scanner docstring spot to work around a bug in our current Python version. See https://bugs.python.org/issue28739 Its formatting was also improved.
-
- Oct 02, 2020
-
-
Tenma authored
-
- Sep 25, 2020
-
-
Tenma authored
- make config_file optional - fail if file does not exist instead of using empty config dict
-
Nicolas Dandrimont authored
-
Nicolas Dandrimont authored
-
Nicolas Dandrimont authored
-
- Sep 23, 2020
-
-
David Douard authored
-
- Sep 17, 2020
-
-
Antoine Lambert authored
Related to T2610
-
Antoine Lambert authored
Related to T2610
-
Antoine Lambert authored
flake8 hook has been removed from https://github.com/pre-commit/pre-commit-hooks so now use the one from https://gitlab.com/pycqa/flake8
-
- Sep 14, 2020
-
-
Tenma authored
Replace PosixPath with Path, which astracts away environment detail, in the whole package
-