Skip to content

Tags

Tags give the ability to mark specific points in history as being important
  • v8.0.8

    ac0e0030 · Publish v2.0.8 ·
    v8.0.8
    
    Fixes (Rust):
    
    * Transposed graph doesn't respect subgraph filter
    * Add support for hashes starting with [a-f] in swhid!() macro
  • v8.0.7

    317bfdc3 · Publish v8.0.7 ·
    v8.0.7
    
    Bug fixes (Python):
    
    * Rename to a more explicit name the tool that maps author IDs to full names
    * Rename end-to-end tests to end-to-end checks
    * Rework end-to-end checks so that they can work on different graphs
    * Add checks for the mapping between author IDs and full names
    * Fix luigi pipeline when compressing a graph with sensitive data
    
    Improvements (Python):
    
    * CLI: Improve the rpc-serve and grpc-serve help
  • v8.0.6

    331824f8 · Publish v8.0.6 ·
    v8.0.6
    
    Bug fixes (Python):
    
    * Add support for sensitive files as input and output for luigi tasks
  • v8.0.5

    56a11f30 · Publish v8.0.5 ·
    v8.0.5
    
    Bug fixes (Python):
    
    * Add missing propagation of local_sensitive_graph_path
    * Add missing docstring that makes Sphinx error
    * Change default LLP gammas to match production
    
    Soundness fixes (Rust):
    
    * Fix unaligned array in in-memory Node2Type
    * Fix unaligned array reads in GOV and GOV
    * Add workarounds to support Miri
  • v8.0.4

    eb9e42b9 · Publish v8.0.4 ·
    v8.0.4
    
    Internal (Rust):
    
    * Cargo.lock: Bump arrow to v55
    * Cargo.lock Fix missing version bump in Cargo.lock
    
    Fixes (Rust):
    
    * Add support for arrow up to 56, to fix build on docs.rs
    
    Misc:
    
    * Synchronize swh-graph-stdlib version with swh-graph
  • v8.0.3

    v8.0.3
    
    Update lockfile
  • v8.0.2

    da0b32f1 · Publish v8.0.2 ·
    v8.0.2
    
    * Include a merge request that was mistakenly excluded from v8.0.1,
      which removes a new feature that wasn't meant to be introduced.
  • v8.0.1

    d4ac3d6d · Publish v8.0.1 ·
    v8.0.1
    
    Fix v8.0.0 release, which did not bump versions in Cargo.toml
  • v8.0.0

    v8.0.0
    
    Breaking changes (Python):
    
    * Remove SENSITIVE_DIR from example dataset
    
    Breaking changes (Rust):
    
    * Split NodeSet trait into ReadNodeSet and NodeSet
    * Move swh_graph::collections to swh_graph_stdlib
    
    New features (Python):
    
    * Add SubdatasetOriginsFromFile
    * Add CLI to symlink a graph (with option to copy some files)
    * Add default values to config checking inner function
    * Generalize SWHID::from_origin_url to support borrowed strings
    * http_client: Enable requests retry on transient errors
    
    New features (Rust):
    
    * Make id2fullname an integral part of swh-graph
    * stdlib: Introduce tree diff utility
    * Add SmallNodeSet
    * Add ContiguousSubgraph
    * Add swh-subgrapher script to build the list of SWHIDs for a subdataset
    * Inline DataFilesAvailability methods
    * Make iter_nodes support arbitrary IntoIterators instead of just slice.
    * Relax unnecessary SwhLabeledForwardGraph constraint to SwhForwardGraph
    
    Fixes (Python):
    
    * download: Execute zstdmt in quiet mode to avoid polluting stdout
    * Don't ignore configured batch size
    * Fix mismatch of object_types names between swh-graph and swh-export
    
    Fixes (Rust):
    
    * Make 'arrow' an optional dependency of swh-graph-topology
    
    Internal:
    
    * Refactor properties backends to provide indexing themselves
    * Use `shell.py` to run the compression pipeline
    * stdlib: Add missing crate metadata
  • v7.0.0

    v7.0.0
    
    Breaking changes (gRPC server):
    
    * Fix inconsistent extension for stats of .json graphs
    
    Breaking changes (Rust):
    
    * Refactor SwhidMphf to embed a permutation
    * Move stdlib to a separate `swh-graph-stdlib` crate
    
    Breaking changes (Python):
    
    * shell.py: Add support for capturing stderr
    
    New features (gRPC server):
    
    * Add num_nodes_by_type and num_arcs_by_type to StatsResponse
    
    New features (Rust):
    
    * Add accessors num_nodes_by_type/actual_num_nodes/num_arcs_by_type
    * Rename FilenameId to LabelNameId (there is an alias for
      backward-compatibility)
    * Add num_label_names() accessor, and iterators on LabelNameId
    * Switch AdaptiveNodeSet from SipHash to RapidHash
    * Extract full names and provide mapping from their corresponding author ID
    
    Fixes (gRPC server):
    
    * grpc-server: Fix deadlock when processing as many requests as there are CPU threads in Traverse
    
    Fixes (Python):
    
    * Fail luigi task when end-to-end tests fail
    * Don't expect -labelled.* files to be produced for graphs with no label
    * Pass missing `test_flavor` argument
    * Pass YAML config parameters to luigi
    * Turn end to end tests into a luigi task
    * Fix handling of local_sensitive_export_path with luigi
    
    Internal (Rust):
    
    * with NodeConstraint to parse CLI args
    * Fixes for Clippy 1.88.0
    * Update tonic
    * pyproject.toml: Use extend-exclude flake8 option instead of exclude
    * Apply swh-py-template v0.3.4 with copier
    * cli: Remove no longer needed type ignore
    * requirements-test: Remove use of deprecated types-click stubs
    * Fix MPH tests
    * Improve debug profile detection
    * Improve typing
    * Rename 'target' to 'profile' in places it had been forgotten
    
    Internal (Python):
    
    * Remove redundant OFFSETS and TRANSPOSE_OFFSETS steps
  • v6.7.6

    pytest_plugin: Workaround regression in aiohttp v3.12.6
    
    tools/dir2graph: Fix compatibility with swh-model v8.1.0
    
  • v6.7.5

    v6.7.5
    
    Fix (Rust):
    
    * Bump common_traits to fix compilation on Rust nightly
  • v6.7.4

    5331845a · Publish v6.7.4 ·
    v6.7.4
    
    Fixes (Rust):
    
    * Switch to released webgraph version
    
    Fixes (Python):
    
    * rust/tests/compress_label_names: Prevent possible infinite loop
    * Unmark ExtractNodes and ExtractLabels as using all CPU threads
    * Remove incorrect dependency of NODE_PROPERTIES on MAPS
    
    Internal (Rust):
    
    * Update dsi-progress-logger to 0.8.0
    
  • v6.7.3

    example_dataset: Update some edge labels and regenerate dataset
    
    e2e_tests: Fix comparison of graph elements with successors
    
    example_dataset: Fix generate_dataset script
    
    Stop publishing graph.{labels,persons}.csv.zst
    
    pytest_plugin: Use safer multiprocessing start method for graph server
    
    pytest_plugin: Ensure to kill swh-graph-grpc-serve process on teardown
    
    Fix compiler errors and Clippy warnings on Rust 1.87.0
    
    Update webgraph to fix 'swh-graph-index labels-ef'
    
    With webgraph >=0.2.0, labels-ef now expects num_nodes instead of num_nodes+1
    
    Add missing dependency of swh-graph-grpc-server on swh-graph's serde feature
    
    Update Webgraph to fix incorrect LLP output
    
    Add exit code to the log file
    
    Document how to generate custom graphs to run tests on
    
    grpc-server: Add support for --graph-format json
    
    grpc-server: Add support for using empty lists of labels/arcs if not available
    
    Add WebgraphAdapter
    
  • v6.7.2

    v6.7.2
    
    Evolutions:
    - Add support for making properties optional at runtime
    - Add methods SwhGraph::{iter_nodes, par_iter_nodes}
    - Set 'profile' in config.py instead of cli.py
    - Rename 'target' to 'profile' in e2e tests
    
    Fixes:
    - Fix label width computation
    - Fix SUCCESSOR mask in node_builder_bitmasks to cover SUCCESSOR_LABEL
    - Fix missing renames
    - Remove Java-specific memory estimation
    - Simplify type of deserialized VecGraphs
    - stdlib/visit: Relax trait constraints on the graph
    - ordered_swhids: Fix unsound concurrent writes to a shared array
    
    Dependency updates:
    - Regenerate Cargo.lock
    - Update to webgraph 0.2.x
    - Adapt to new version of swh-export
    - pyo3: update to 0.24.0
    - Switch to released version of orc-rust
    - Remove unused dependency on swh-storage
    
    Documentation:
    - Add compression_steps.dot to .gitignore
    - Automatically generate compression tasks diagram
    - Update hardware and recommended system config for graph compression
    - Use anonymous references to avoid duplicate ids
    - Fix markdown syntax
    - README: add reference to the WWW 24 paper about swh-graph-rs
    - Document all features on docs.rs
    
  • v6.7.1

    v6.7.1
    
    * Un-revert orc-rust dependency so Python tests pass
  • v6.7.0

    9596534c · Publish v6.7.0 ·
    v6.7.0
    
    Fixes (Rust):
    
    * Fix SwhGraphProperties::label_name_id to correctly bisect on 2023-09-06 and newer graphs
    
      Positive results of label_name_id ("the name exists and has this id"), fs_resolve_name, and fs_resolve_path were correct, negative results may be incorrect.
      Positive results of find_head_rev may be incorrect (in the case of repositories with both `refs/heads/main` and `refs/heads/master`) as well as its negative results.
    
      The gRPC server is not affected.
    
    Improvements (Rust):
    
    * Replace jemalloc with mimalloc as swh-graph-extract's allocator
    
    Fixes (Python):
    
    * Make swh-export an optional dependency, with a new 'export' extra
    * Fix compression for example dataset generation
    * Migrate from deprecated pkg_resources package to importlib.metadata
    
    New features (Python):
    
    * Expose a base Luigi Task class to upload to S3 without creating an Athena table
    
    Internal:
    
    * Fix flakiness of statsd tests on Jenkins
    * Fix Clippy warnings on Rust 1.85.
    * new tool: counters. Counts nodes by type, and total/max content length
  • v6.6.0

    v6.6.0
    
    New features (Python):
    
    * Make swh.graph.shell.Rust usable by other packages
    * Add end-to-end tests for newly compressed graphs
    
    New features (Rust):
    
    * Relax constraint that UnderlyingGraph labels must impl Copy to simplify code
    
    Internal:
    
    * luigi/compressed_graph: Fix flake8 warnings
    * webgraph: Fix black formatting
    * Apply swh-py-template v0.3.3 with copier
    * requirements-test: Remove no longer needed pytest-asyncio dependency
    * Move dataset generation to other repositories
    
    Documentation:
    
    * Document that 'cargo install' is needed to reindex
  • v6.5.0

    v6.5.0
    
    Changes:
    
    * Rename swh-dataset dependency to swh-export
    
    New features:
    
    * Test new compressed graphs
    
    Fixes:
    
    * Add missing dependency of CompressGraph on Stats step
    * Fix pytest detection
    * docker: Update dockerfile to use swh-graph rust binaries
    * doc: Update CLI commands to compress a graph
  • v6.4.0

    v6.4.0
    
    The Rust crate is not released in this version of swh-graph, as it depends
    on an un-released version of orc-rust, which orc-rust would reject
    
    Python minor breaking changes:
    
    * Drop support for python 3.7
    
    Python improvements:
    
    * Add some defaults values to simplify automation
    * Add RunNewGraph meta-task
    
    Python bug fixes:
    
    * Add support for blobs.csv.zst referencing unknown or skipped contents
    * Add a warning notice about the output of 'swh graph list-datasets'
    * docs/compression: Add missing --no-sign-request option to aws command
    * shell: Fix hangs in pytest
    * Create provenance directory when it does not exist
    * luigi: Fix crash in derived datasets declaration when compressed graph is not available
    * Add version restriction to avoid a regression in pyarrow
    * Add grpcio-tools version requirement
    * Update protobuf to 5.29
    * docs/quickstart: Add pv in required debian package to install
    * aggregate_datasets: Use precomputed revisions-in-origins table from provenance
    
    Rust bug fixes:
    
    * Update dataset-writer to avoid exceeding Parquet's max row-group count
    * Update orc-rust to v0.5.0+three patches
    
    Internal:
    
    * grpc_server: Fix clippy::needless_lifetimes warnings
    
    Rust new features:
    
    * provenance: Add revisions-in-origins index generation