-
v1.7.173de7917 · ·
v1.7.1 Doc: * Advertise 2024-08-23-popular-500-python teaser dataset * Advertise 2024-08-23 graph export * The 2019-01-28-popular-3k-python dataset did not have a compressed graph * Fix typo Fixes: * generate_subdataset: Fix crash on excessively large release.message values Internal: * journalprocessor: Fix typo spotted after codespell upgrade * journalprocessor: Fix black formatting * Apply swh-py-template v0.2.3
-
v1.3.38dd81df3 · ·
v1.3.3 * luigi: Work around absence of metadata files in old exports * luigi: Add support for downloading exports in parallel * luigi: Delete all files in the root dir instead of the root dir itself * docs: Add links to Terms of Use and 'How to use SWH data' * Fix Sphinx role (:cls: -> :class:)
-
v1.0.3a01a82fc · ·
v1.0.3 * luigi.UploadExportToS3: Skip upload of already-uploaded files * luigi: Dynamically list directories instead of using object_types * luigi: Read meta/export.json instead of relying on stamp files * docs/index.rst: Add missing new line at end of file * docs/index.rst: Fix sphinx tag name * docs: Include module indices only when building standalone package doc
-
v1.0.2c717f60f · ·
v1.0.2 * exporters/orc: Fix crash on visit status with no type * luigi.CreateAthena: Fix validation of DB name * luigi.RunExportAll: Default to exporting all formats
-
v1.0.1a1cf9b87 · ·
v1.0.1 * luigi: Rename classes to be globally unambiguous * luigi: Send progress reports to the scheduler
-
v1.0.0f8a13718 · ·
v1.0.0 * Update for swh-objstorage >= 2.0.0 * docs/athena: Fix value of --location-prefix * Fix link to the 2021-03-23 compressed dataset * cli: Sort object types to be processed in the right order * cli: Increase open file descriptor limit to support 256 open LevelDBs * athena: Fix create_table to work with restricted permissions * Add luigi tasks