diff --git a/docs/graph/dataset.rst b/docs/graph/dataset.rst index 820c937ed6c4ccd5e077579b9df1a12729925328..45b9d6d8fd3c6576e000ef7a87b990ed9d3a430e 100644 --- a/docs/graph/dataset.rst +++ b/docs/graph/dataset.rst @@ -122,12 +122,16 @@ Summary of dataset versions - ✔ - ✔ + Full graph datasets ------------------- Because of their size, some of the latest datasets are only available for downside from Amazon S3. + +.. _graph-dataset-2023-09-06: + 2023-09-06 ~~~~~~~~~~ @@ -143,6 +147,9 @@ A full export of the graph dated from September 2023 - **Total size**: 8.8 TiB - **S3**: ``s3://softwareheritage/graph/2023-09-06/compressed`` + +.. _graph-dataset-2022-12-07: + 2022-12-07 ~~~~~~~~~~ @@ -166,6 +173,13 @@ A full export of the graph dated from December 2022 - **Total size**: 1 TiB - **S3**: ``s3://softwareheritage/graph/2022-12-07-history-hosting/compressed`` +- **Erratum**: + + - `author and committer timestamps were shifted back 1 or 2 hours, based on the Europe/Paris timezone <https://gitlab.softwareheritage.org/swh/devel/swh-graph/-/issues/4788>`_ + + +.. _graph-dataset-2022-04-25: + 2022-04-25 ~~~~~~~~~~ @@ -182,6 +196,8 @@ A full export of the graph dated from April 2022 - **S3**: ``s3://softwareheritage/graph/2022-04-25/compressed`` +.. _graph-dataset-2021-03-23: + 2021-03-23 ~~~~~~~~~~ @@ -199,6 +215,8 @@ A full export of the graph dated from March 2021. - **S3**: ``s3://softwareheritage/graph/2021-03-23/compressed`` +.. _graph-dataset-2020-12-15: + 2020-12-15 ~~~~~~~~~~ @@ -211,6 +229,8 @@ compressed representation. <https://annex.softwareheritage.org/public/dataset/graph/2020-12-15/compressed/>`_ +.. _graph-dataset-2020-05-20: + 2020-05-20 ~~~~~~~~~~ @@ -225,6 +245,8 @@ compressed representation. <https://annex.softwareheritage.org/public/dataset/graph/2020-05-20/compressed/>`_ +.. _graph-dataset-2019-01-28: + 2019-01-28 ~~~~~~~~~~ @@ -253,6 +275,9 @@ Teaser datasets If the above datasets are too big, we also provide "teaser" datasets that can get you started and have a smaller size fingerprint. + +.. _graph-dataset-2021-03-23-popular-3k-python: + 2021-03-23-popular-3k-python ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ @@ -281,6 +306,8 @@ was the following: - **S3**: ``s3://softwareheritage/graph/2021-03-23-popular-3k-python/compressed/`` +.. _graph-dataset-2020-12-15-gitlab-all: + 2020-12-15-gitlab-all ~~~~~~~~~~~~~~~~~~~~~ @@ -292,6 +319,9 @@ Available in compressed graph format. - **URL**: `/graph/2020-12-15-gitlab-all/compressed/ <https://annex.softwareheritage.org/public/dataset/graph/2020-12-15-gitlab-all/compressed/>`_ + +.. _graph-dataset-2020-12-15-gitlab-100k: + 2020-12-15-gitlab-100k ~~~~~~~~~~~~~~~~~~~~~~ @@ -304,6 +334,8 @@ exported in December 2020. Available in compressed graph format. <https://annex.softwareheritage.org/public/dataset/graph/2020-12-15-gitlab-100k/compressed/>`_ +.. _graph-dataset-2019-01-28-popular-4k: + 2019-01-28-popular-4k ~~~~~~~~~~~~~~~~~~~~~ @@ -325,6 +357,8 @@ was the following: <https://annex.softwareheritage.org/public/dataset/graph/2019-01-28-popular-4k/parquet/>`_ - **S3**: ``s3://softwareheritage/graph/2019-01-28-popular-4k/parquet/`` +.. _graph-dataset-2019-01-28-popular-3k-python: + 2019-01-28-popular-3k-python ~~~~~~~~~~~~~~~~~~~~~~~~~~~~