Skip to content

ExtractNodes: read ORC files in parallel

Spawn many sort(1) in parallel to avoid locking, then a sort -m to merge all the batches

Benchmarks on popular-3k-python:

Before: 51:52.18 total After: 5:53.77 total


Migrated from D7733 (view on Phabricator)

Merge request reports