Skip to content

ORC file export: Adastra copy of dataset

The deduplicator exports graph data in ORC format based on messages written by loaders into Kafka. It needs to create two datasets:

  • one that will remain on Adastra
  • one that will be back-exported to the SWH main archive (#52)

This issue is concerned with the former.

Edited by Simeon Carstens