-
v1.9.0f96210ce · ·
v1.9.0 - Rename command from `swh dataset` to `swh export` - Switch to swh-journal 2.0 - Switch to model objects in order to .anonymize() revisions and releases - Export new ORC table containing author fullnames and fullnames hashes - Advertise 2023-09-06-history-hosting and 2024-12-06-history-hosting - Document the 2024-05-16-history-hosting dataset Fixes: - Fix hanging in exports with no persons - Make sensitive export path consistent
-
v1.7.173de7917 · ·
v1.7.1 Doc: * Advertise 2024-08-23-popular-500-python teaser dataset * Advertise 2024-08-23 graph export * The 2019-01-28-popular-3k-python dataset did not have a compressed graph * Fix typo Fixes: * generate_subdataset: Fix crash on excessively large release.message values Internal: * journalprocessor: Fix typo spotted after codespell upgrade * journalprocessor: Fix black formatting * Apply swh-py-template v0.2.3
-
v1.3.38dd81df3 · ·
v1.3.3 * luigi: Work around absence of metadata files in old exports * luigi: Add support for downloading exports in parallel * luigi: Delete all files in the root dir instead of the root dir itself * docs: Add links to Terms of Use and 'How to use SWH data' * Fix Sphinx role (:cls: -> :class:)
-
v1.0.3a01a82fc · ·
v1.0.3 * luigi.UploadExportToS3: Skip upload of already-uploaded files * luigi: Dynamically list directories instead of using object_types * luigi: Read meta/export.json instead of relying on stamp files * docs/index.rst: Add missing new line at end of file * docs/index.rst: Fix sphinx tag name * docs: Include module indices only when building standalone package doc
-
v1.0.2c717f60f · ·
v1.0.2 * exporters/orc: Fix crash on visit status with no type * luigi.CreateAthena: Fix validation of DB name * luigi.RunExportAll: Default to exporting all formats
-
v1.0.1a1cf9b87 · ·
v1.0.1 * luigi: Rename classes to be globally unambiguous * luigi: Send progress reports to the scheduler