From cac32f213799eff93768c94b000755db08174868 Mon Sep 17 00:00:00 2001 From: Valentin Lorentz <vlorentz@softwareheritage.org> Date: Fri, 3 Mar 2023 14:24:17 +0100 Subject: [PATCH] Advertize 2022-12-07 dataset --- docs/graph/dataset.rst | 22 ++++++++++++++++++++++ 1 file changed, 22 insertions(+) diff --git a/docs/graph/dataset.rst b/docs/graph/dataset.rst index f4b41b1..8f4efa5 100644 --- a/docs/graph/dataset.rst +++ b/docs/graph/dataset.rst @@ -33,6 +33,12 @@ Summary of dataset versions - Columnar - Compressed + * - `2022-12-07`_ + - 27,397,574,122 + - 416,565,871,870 + - ✔ + - ✔ + * - `2022-04-25`_ - 25,340,003,875 - 375,867,687,011 @@ -111,6 +117,21 @@ Full graph datasets Because of their size, some of the latest datasets are only available for downside from Amazon S3. +2022-12-07 +~~~~~~~~~~ + +A full export of the graph dated from December 2022 + +- **Columnar tables (Apache ORC)**: + + - **Total size**: 13 TiB + - **S3**: ``s3://softwareheritage/graph/2022-12-07/orc`` + +- **Compressed graph**: + + - **Total size**: 7.1 TiB + - **S3**: ``s3://softwareheritage/graph/2022-12-07/compressed`` + 2022-04-25 ~~~~~~~~~~ @@ -123,6 +144,7 @@ A full export of the graph dated from April 2022 - **Compressed graph**: + - **Total size**: 6.5 TiB - **S3**: ``s3://softwareheritage/graph/2022-04-25/compressed`` -- GitLab