- Apr 13, 2021
-
-
vlorentz authored
-
- Apr 06, 2021
-
-
vlorentz authored
We want ro remove Revision.metadata. This aligns HgBundle20Loader's behavior on HgLoaderFromDisk's.
-
vlorentz authored
It already writes it with raw_extrinsic_metadata_add/extid_add, and read it with extid_get_*. This code was only kept for compatibility while we were migrating the extids. This is now done, so this code is useless.
-
- Mar 30, 2021
-
-
vlorentz authored
SWHID has a specific meaning defined in https://docs.softwareheritage.org/devel/swh-model/persistent-identifiers.html (in short, the sha1_git is only part of a SWHID), but this variables store only the hash part of the SWHIDs.
-
vlorentz authored
-
vlorentz authored
For now, ExtIDs are used in addition to revision metadata. But in the near future, we want to migrate nodeids from revision metadata to the ExtID storage, and drop all revision metadata.
-
vlorentz authored
-
vlorentz authored
This is a minor performance optimization, removing items from the call to revision_get when we know their result will be None. Motivation: A future commit will refactor this function, and dealing only with revision ids makes it simpler.
-
- Mar 29, 2021
-
-
vlorentz authored
Instead of using hashutil.MultiHash directly and converting to model.Content.
-
- Mar 05, 2021
- Feb 26, 2021
-
-
Summary: When a repository has corrupted revision, the revision and its descendants are not loaded. This commit only deals with missing `filelogs` and configures the exclusion system. Missing `filelogs` are recoverable errors that should be skipped or saved as `SkippedContent` but being missing the `SkippedContent` cannot be calculated. This point is left for future commits. Reviewers: #reviewers Subscribers: ardumont, Alphare, vlorentz Differential Revision: https://forge.softwareheritage.org/D4688
-
Raphaël Gomès authored
This generalizes the work done in ef3a2ba7 to (supposedly) all places invoking Mercurial. In short, this limits the environment to the smallest subset needed (i.e. `$PATH`) and uses the Mercurial-specific variables to disable user customizations and configs.
-
Antoine R. Dumont authored
-
Summary: By looking at the previous snapshot heads, loading of an unchanged repository will be uneventful. Reviewers: #reviewers, douardda Reviewed By: #reviewers, douardda Subscribers: douardda Differential Revision: https://forge.softwareheritage.org/D4643
-
Raphaël Gomès authored
By default, Mercurial updates to the default revision after cloning, which - while probably good UX - is wasteful in the context of automated archival and server-side operations.
-
- Feb 25, 2021
-
-
Antoine R. Dumont authored
This module is: - not referenced in the setup.py (so no simple cli call) - not tested - using an in-memory storage - no longer the canonical way of triggering a mercurial ingestion Note that this also fixes the visit date cli input. When user provided, this date should be parsed as a datetime prior to being passed to the loader constructor.
-
- Feb 23, 2021
-
-
Raphaël Gomès authored
There is currently no end-to-end test to catch this regression. I'm not certain whether there is something in place to write such tests, but this is already better to have a working cli.
-
Raphaël Gomès authored
This change adds two environment variables that have been supported for 10+ years by Mercurial to make its output predictable for use in scripts. This was already done by a previous patch in tests, but it is also (even more?) useful here.
-
Raphaël Gomès authored
We don't want the user's environment to affect `hg`'s behavior. The bare minimum is the `PATH`. In the next patch, we add the Mercurial-specific variables to ensure a "vanilla" behavior.
-
Raphaël Gomès authored
The next patch will add an import and `isort` was complaining. It appears that this file hasn't been changed since the `isort` change.
-
Raphaël Gomès authored
Tests can break when ran in user environments if the output is customized, either by config options (like aliases) or extensions.
-
- Feb 17, 2021
-
-
Antoine R. Dumont authored
Note that this also updated some docstrings and types along the way. Related to T1410
-
- Feb 15, 2021
-
-
Antoine R. Dumont authored
This avoids failing visits for the wrong comparison check [1] [1] https://sentry.softwareheritage.org/share/issue/27017710a5ec49f991910a780d38d4ab/
-
Vincent Sellier authored
Related to T3030
-
- Feb 09, 2021
-
-
Vincent Sellier authored
Related to T3030
-
- Feb 03, 2021
-
-
Antoine R. Dumont authored
This should unstuck the debian build which complains about those not being registered.
-
- Feb 01, 2021
-
-
Antoine R. Dumont authored
-
- Dec 01, 2020
-
-
Antoine Cezar authored
By looking at differences between revisions, the repository tree is updated rather that fully rebuild for each one. Observed load time improvement on https://www.mercurial-scm.org/repo/hg/ 1:11:02 -> 47:58
-
- Nov 30, 2020
-
-
Antoine Cezar authored
Avoid recalculation of unchanged content hash between revisions
-
Antoine Cezar authored
Rather than relying on mercurial bundles this loader expect a local repository.
-
- Nov 12, 2020
-
-
Antoine Cezar authored
First updatable example repository documented by its generation script.
-
Antoine Cezar authored
Generated json files with `swh/loader/mercurial/tests/data/build.py` for existing repositories and added them to `Hg20BundleLoader` tests. Introduce `LoaderChecker` as a standardized way to test repositories against json files.
-
Antoine Cezar authored
This build script purpose is to create example repositories from bash scripts and extract assertion data from them into json files. Advantages: - the bash script documents the repository creation - automating creation allow easy repository update - automation extraction allow easier update of assertion data
-
Antoine Cezar authored
-
- Oct 02, 2020
-
-
Stefano Zacchiroli authored
-
Antoine R. Dumont authored
Related to T1532 T1410 D3965
-
- Sep 25, 2020
-
-
Nicolas Dandrimont authored
vcversioner was already replaced with setuptools-scm months ago.
-
- Sep 18, 2020
-
-
Antoine R. Dumont authored
Fixes build [1] [1] https://jenkins.softwareheritage.org/job/DLDHG/job/tests/793
-
- Sep 17, 2020
-
-
Antoine Lambert authored
Related to T2610
-