Skip to content
Snippets Groups Projects
Commit baa20001 authored by vlorentz's avatar vlorentz
Browse files

Add CRAN and Crates

parent 739caaef
No related tags found
No related merge requests found
......@@ -17,6 +17,7 @@ repos:
hooks:
- id: codespell
name: Check source code spelling
args: [-L crate]
stages: [commit]
- repo: local
......
......@@ -94,6 +94,7 @@ forges:
lister:
status: dev
issue: https://gitlab.softwareheritage.org/swh/devel/swh-lister/-/issues/4547
supports_last_update: true
loader:
status: dev
issue: https://gitlab.softwareheritage.org/swh/devel/swh-loader-core/-/issues/4579
......@@ -108,6 +109,7 @@ forges:
lister:
status: dev
issue: https://gitlab.softwareheritage.org/swh/devel/swh-lister/-/issues/4520
supports_last_update: true
loader:
status: dev
issue: https://gitlab.softwareheritage.org/swh/devel/swh-lister/-/issues/2833
......@@ -122,17 +124,25 @@ forges:
lister:
status: prod
id_in_swh_web: CRAN
supports_last_update: true
loader:
status: prod
metadata:
intrinsic: collected
extrinsic: none
crates:
name: Crates
lister:
status: dev
issue: https://gitlab.softwareheritage.org/swh/devel/swh-lister/-/issues/1424
supports_last_update: true
loader:
status: dev
issue: https://gitlab.softwareheritage.org/swh/meta/-/issues/4104
metadata:
intrinsic: collected
extrinsic: not collected
grant: sloan-hashbang-2022
developer: hashbang
......
......@@ -3,17 +3,19 @@
CRAN
====
.. todo::
This page is a work in progress.
.. include:: dynamic/cran_status.inc
TODO:
The `Comprehensive R Archive Network <https://cran.r-project.org/>`_ is the package
management system of the R language.
CRAN does not expose a language-agnostic API with the information we need, so for
simplicity/efficiency, |swh|'s CRAN lister calls a small R script in order to list
all R packages.
Then for each package, it creates an origin with
:file:`https://cran.r-project.org/package={package_name}` as URL.
* description of the software origin
* summary of the lister's algorithm
* summary of the loader's algorithm
* URL pattern
* collect extrinsic metadata?
* index extrinsic metadata?
* index intrinsic metadata?
R packages have intrinsic metadata, mostly the :file:`DESCRIPTION` file in their root
directory, in the `deb822 <https://manpages.debian.org/bookworm/dpkg-dev/deb822.5.en.html>`_
format.
|swh|'s R loader parses it to extract authorship information, but this file is otherwise
not parsed yet.
......@@ -3,17 +3,12 @@
Crates
======
.. todo::
This page is a work in progress.
.. include:: dynamic/crates_status.inc
TODO:
`crates.io <https://crates.io/>`_ is the package manager of the `Rust programming language
<https://www.rust-lang.org/>`_.
* description of the software origin
* summary of the lister's algorithm
* summary of the loader's algorithm
* URL pattern
* collect extrinsic metadata?
* index extrinsic metadata?
* index intrinsic metadata?
It relies on `an index hosted on GitHub <https://github.com/rust-lang/crates.io-index>`_,
and provides `database dumps <https://crates.io/data-access>`_, which |swh| uses to
list packages, and create origins using this pattern:
:file:`https://crates.io/crates/{crate}`.
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment