Skip to content
Snippets Groups Projects

Introduce bioconductor lister

Merged Kumar Shivendu requested to merge KShivendu/swh-lister:bioconductor-lister into master

The way this lister works is that it scrapes origins from pages like:

Note the pattern https://bioconductor.org/packages/json/<release-version>/<category>/packages.json

Here, I've coined the term "category" to represent the 4 possible values: bioc, workflows, data/annotation, and data/experiment.

This packages.json file is a dictionary of all the packages. We iterate through them and send them to the scheduler.

Related to #4699

Refs. swh/meta#5051

Edited by Antoine R. Dumont

Merge request reports

Loading
Loading

Activity

Filter activity
  • Approvals
  • Assignees & reviewers
  • Comments (from bots)
  • Comments (from users)
  • Commits & branches
  • Edits
  • Labels
  • Lock status
  • Mentions
  • Merge request status
  • Tracking
9
10 import iso8601
11
12 from swh.scheduler.interface import SchedulerInterface
13 from swh.scheduler.model import ListedOrigin
14
15 from ..pattern import StatelessLister
16
17 logger = logging.getLogger(__name__)
18
19 Release = str
20 Category = str
21 BioconductorListerPage = Tuple[Release, Category, Dict[str, Any]]
22
23
24 DEFAULT_BIOCONDUCTOR_RELEASES = [
  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
  • Please register or sign in to reply
    Loading