- Jan 06, 2023
-
-
vlorentz authored
-
vlorentz authored
-
vlorentz authored
-
vlorentz authored
two paths + reimport tasks at the package level so they are automatically picked up by luigi without passing all module names on the CLI
-
vlorentz authored
-
vlorentz authored
-
vlorentz authored
whitelist_externals is deprecated since v3.18, and removed since v4.0
-
- Jan 03, 2023
-
-
vlorentz authored
RPCServerApp inherits from aiohttp.web.Application, and GraphServerApp does not use any of the features added by RPCServerApp
-
vlorentz authored
It's slightly more accurate
-
vlorentz authored
-
vlorentz authored
Origin pseudo-SWHIDs are kind of useless on their own, as there is no public API to resolve them yet; so we might as well produce integers instead. A future commit will add a mapping table to URLs, which are actually useful.
-
- Dec 19, 2022
-
-
Antoine Lambert authored
In order to remove warnings about /apidoc/*.rst files being included multiple times in toc when building full swh documentation, prefer to include module indices only when building standalone package documentation. Also include them the proper sphinx way. Related to T4496
-
- Dec 07, 2022
-
-
vlorentz authored
RunAll is not practical, because we need to mount the graph on a tmpfs at some point. Let's keep that step manual for now, which means RunExportCompressUpload and DeanonymizeOriginContributors will be called separately.
-
vlorentz authored
I cannot find how to make WebGraph log to stderr instead of stdout, so it seems to be the only way.
-
vlorentz authored
-
Antoine Lambert authored
It ensures luigi will be available after having installed swh development environment.
-
Antoine Lambert authored
-
vlorentz authored
-
vlorentz authored
-
vlorentz authored
It can be cumbersome to set paths for all (recursives) dependencies of the task we want to run; this CLI endpoint takes care of most of them.
-
vlorentz authored
It is going to get large, with the future addition of tasks to generate the license dataset and the citation dataset.
-
vlorentz authored
-
vlorentz authored
This triggers a bug in ListOriginContributors, causing it to include "null" as a contributor. A future commit will fix this.
-
vlorentz authored
This Java script (and related Luigi tasks) traverse the graph in topological order, building up the set of all contributors to a node and its ancestors, then dump the value of this set for every origin node they encounter.
-
vlorentz authored
-
vlorentz authored
-
vlorentz authored
This allows readers to efficiently get ancestors of nodes with low indegree (ie. most revisions), as it avoids a random access / API call.
-
vlorentz authored
-
vlorentz authored
-
vlorentz authored
-
vlorentz authored
- Dec 06, 2022
-
-
Roberto Di Cosmo authored
-
- Nov 29, 2022