use a compact, binary format for node ids mapping files
We need a more compact way of storing the PID <-> long id mappings (i.e.: binary file).
Should also help with D1802.
Migrated from T1944 (view on Phabricator)
Designs
- Show closed items
Activity
-
Newest first Oldest first
-
Show all activity Show comments only Show history only
- Thibault Allançon added Compressed graph service priority:Normal labels
added Compressed graph service priority:Normal labels
- Stefano Zacchiroli changed title from More compact format for node ids mapping files to use a compact, binary format for node ids mapping files
changed title from More compact format for node ids mapping files to use a compact, binary format for node ids mapping files
- Phabricator Migration user marked this issue as related to #1950 (closed)
marked this issue as related to #1950 (closed)
- Maintainer
Status update: we have now binary serialization formats for the two maps, see docstrings of PidToIntMap and IntToPidMap in swh.graph.pid That means that Python code can //read// the compact maps (and also write them, but at a speed that is impractical for generation). Conversion of the textual maps generated for the most recent compressed graph is ongoing and almost completed.
The generation of those maps will need to happen on the Java side, and it's still pending.
- Stefano Zacchiroli added state:wip label
added state:wip label
- Stefano Zacchiroli added priority:High label and removed priority:Normal label
added priority:High label and removed priority:Normal label
- Phabricator Migration user mentioned in commit 998a4435
mentioned in commit 998a4435
- Stefano Zacchiroli assigned to @zack
assigned to @zack
- Stefano Zacchiroli removed state:wip label
removed state:wip label
- Stefano Zacchiroli closed
closed
- Maintainer
reopen, as it's not fixed in master yet
- Stefano Zacchiroli reopened
reopened
- Maintainer
Closed in rDGRPH998a44353612
- Stefano Zacchiroli closed
closed
- Phabricator Migration user mentioned in issue #1950 (closed)
mentioned in issue #1950 (closed)