- Feb 17, 2023
-
-
Antoine Lambert authored
Related to swh/meta#4960
-
- Feb 16, 2023
-
-
Related to swh/meta#4959
-
Antoine Lambert authored
-
- Feb 15, 2023
-
- Feb 14, 2023
-
-
vlorentz authored
Each thread handled one 96th of the node id range. But nodes are not homogeneously randomized across that range, so some threads had a lot more work to do than others, causing them to end weeks after, while most CPU cores idled. By splitting the range this way, threads should have more homogeneous workloads.
-
vlorentz authored
-
vlorentz authored
-
vlorentz authored
-
vlorentz authored
-
vlorentz authored
This is much more performant on the dir layer: it only takes 27 hours on swh1.enst.fr instead of an ETA of one or two years, most of it spent on this particular line: ``` LazyLongIterator successorAncestors = graph.successors(successorNodeId); ``` even when replacing all the code that used `successorAncestors`, it was still the major cause of the huge expected runtime.
-
protobuf earlier 4.12.* versions crashed when `swhgraph_pb2.py` was discovered by pytest. This has been fixed in the 4.12.11 release. See: https://github.com/protocolbuffers/protobuf/issues/10151 Preventing protobuf 4.12.* to be used makes `pip` install grpcio-tools version 1.49.0 which fails to build on Debian bookworm. In order to allow more fixed versions of grpcio-tools to be used, we bump the dependency on protobuf to version 4.12.11 or later.
-
vlorentz authored
This will be used as a metric for 'popularity' of directories, which will be used to weigh results of PopularContents (which counts the most popular names used to refer to each content)
-
vlorentz authored
-
vlorentz authored
-
- Feb 13, 2023
-
- Feb 02, 2023
-
-
Antoine Lambert authored
This fixes python 3.7 support due to poetry, a dependency of isort, that removed support for that Python version in a recent release.
-
- Jan 26, 2023
-
- Jan 17, 2023
-
-
vlorentz authored
-
- Jan 16, 2023
-
-
vlorentz authored
Because the main table already contains (origin_id, person_id) pairs, it is redundant to repeat every pair in this file.
-
- Jan 13, 2023
- Jan 12, 2023
-
-
vlorentz authored
-
- Jan 11, 2023
-
-
vlorentz authored
They tend to be forgotten; this will make sure that running 'tox' will run Java tests as well as Python tests. It will also make it easier to run Java tests on the CI
-
- Jan 06, 2023