- Dec 07, 2022
-
-
vlorentz authored
It is easier to keep track of it in Grafana.
-
vlorentz authored
RunAll is not practical, because we need to mount the graph on a tmpfs at some point. Let's keep that step manual for now, which means RunExportCompressUpload and DeanonymizeOriginContributors will be called separately.
-
vlorentz authored
I cannot find how to make WebGraph log to stderr instead of stdout, so it seems to be the only way.
-
vlorentz authored
-
Antoine Lambert authored
It ensures luigi will be available after having installed swh development environment.
-
Antoine Lambert authored
-
vlorentz authored
-
vlorentz authored
-
vlorentz authored
It can be cumbersome to set paths for all (recursives) dependencies of the task we want to run; this CLI endpoint takes care of most of them.
-
vlorentz authored
It is going to get large, with the future addition of tasks to generate the license dataset and the citation dataset.
-
vlorentz authored
-
vlorentz authored
This triggers a bug in ListOriginContributors, causing it to include "null" as a contributor. A future commit will fix this.
-
vlorentz authored
This Java script (and related Luigi tasks) traverse the graph in topological order, building up the set of all contributors to a node and its ancestors, then dump the value of this set for every origin node they encounter.
-
vlorentz authored
-
vlorentz authored
-
vlorentz authored
This allows readers to efficiently get ancestors of nodes with low indegree (ie. most revisions), as it avoids a random access / API call.
-
vlorentz authored
-
vlorentz authored
-
vlorentz authored
-
vlorentz authored
-
- Dec 06, 2022
-
-
Roberto Di Cosmo authored
-
- Nov 29, 2022
- Nov 28, 2022
-
-
vlorentz authored
-
- Nov 25, 2022
-
-
Roberto Di Cosmo authored
-
Roberto Di Cosmo authored
-
Roberto Di Cosmo authored
-
Roberto Di Cosmo authored
-
Roberto Di Cosmo authored
-
Roberto Di Cosmo authored
-
Roberto Di Cosmo authored
-
Roberto Di Cosmo authored
-
Roberto Di Cosmo authored
-
Roberto Di Cosmo authored
-