Skip to content
Snippets Groups Projects
20240701_faq.md 25.45 KiB
tags: documentation, ambassadors

Why this new version? Slight changes have to be made. See questions in: https://hedgedoc.softwareheritage.org/6hFGAHdUTY2wWKr5OqwzkQ?both# And: https://matrix.to/#/!ibcUTtDWIkKlNHODcp:matrix.org/$KGh0Z7J59gDRKj_M1xoA9XNCV1sMpm6JO9ON4WffGBs?via=matrix.org&via=sdfa3.org https://matrix.to/#/!ibcUTtDWIkKlNHODcp:matrix.org/$akqQRW0J6FvZsgyoqD0oZ5Xitusq7DPRj82V2oxBlL8?via=matrix.org&via=sdfa3.org https://matrix.to/#/!ibcUTtDWIkKlNHODcp:matrix.org/$vMENrLIoVuZ-ZF-OSBXWycWhwBzwO8ZR1eElCcgbZ5w?via=matrix.org&via=sdfa3.org https://matrix.to/#/!ibcUTtDWIkKlNHODcp:matrix.org/$Nspro-CpYaVoeQ-EDGZKJYWMMeVtgUpxbiD_vQLm8Oo?via=matrix.org&via=sdfa3.org

FAQ Software Heritage (website)

1. General

1.1 What is Software Heritage (SWH)?

Software Heritage is an open, non-profit infrastructure unveiled in 2016 by Inria. It is supported by a broad panel of institutional and industry partners, in collaboration with UNESCO.

Expand for details

The long term goal is to collect all publicly available software in source code form together with its development history, replicate it massively to ensure its preservation, and share it with everyone who needs it.

For more information about the Software Heritage mission.

1.2 What is the Software Heritage archive?

The Software Heritage archive is the largest public collection of source code in existence. Visit the archive on https://archive.softwareheritage.org.

1.3 What is the size of the archive?

The archive is growing over time as we crawl new source code from software projects and development forges. You can see live counters of the archive contents, as well as a breakdown by crawled origins, on https://archive.softwareheritage.org.

1.4 What are the services provided by Software Heritage?

Software Heritage is a mutualised platform that offers a growing number of services to a large spectrum of users.

The features page provides an overview of the features currently available. This includes, for example, archiving software repositories, browsing the archived source code and providing persistent identification.

2. Archiving software

2.1 Which software platforms (forges, package managers, etc.) are archived?

The software origins that are currently regularly archived are listed on the main archive page.

Expand for details

Here is an excerpt of this list:

  • Git repositories from multiple forges (GitHub, Bitbucket, GitLab instances, cgit instances, Gitea instances, Phabricator instances, etc.)
  • SVN repositories...
  • Mercurial repositories...
  • Debian packages in apt
  • Python packages in PyPI
  • R packages in CRAN
  • NPM packages in npm.org
  • zip or archives tarballs in gnu.org

2.2 If my code is on GitHub/GitLab/Bitbucket, is it already archived in Software Heritage?

It might be, as we crawl these and other popular forges regularly. Search for your code repository on https://archive.softwareheritage.org/browse/search/.