Skip to content
Snippets Groups Projects
  1. Aug 03, 2022
  2. Jun 15, 2022
    • Franck Bret's avatar
      Add arch lister module (origins from archives). · 1bf11aa2
      Franck Bret authored
      After a first attempt with D7812 this one use a different strategy to
      retrieve origins.
      
      Fetch and extract "core.files.tar.gz", "extra.files.tar.gz" and "community.files.tar.gz" from archives.archlinux.org. That step ensure that we have a list of "official" packages.
      Parse metadata from 'desc' file to build origins url.
      Scrap the origin url to get artifacts metadata that list all versions of a package.
      
      It also fetch and extract unofficial 'arm' packages from archlinuxarm.org but in this case we can not get all versions of an arm package.
      
      Related T4233
      1bf11aa2
  3. May 23, 2022
  4. May 20, 2022
  5. May 09, 2022
  6. May 02, 2022
    • Antoine Lambert's avatar
      maven: Prevent UnicodeDecodeError when processing pom file · 3f6c7edc
      Antoine Lambert authored
      Pass the raw bytes of pom file content in xmltodict.parse and let
      it do the string decoding based on the encoding declared in pom file.
      
      If the string decoding failed due to an invalid declared encoding,
      xml.parsers.expat.ExpatError will be raised and will be caught by
      the lister, ignoring the pom file and continuing listing.
      
      Related to T3874
      3f6c7edc
  7. Apr 29, 2022
    • Antoine Lambert's avatar
      maven: Handle null mtime value in index for jar archive · 0222a8f5
      Antoine Lambert authored
      It exists cases where the modification time for a jar archive in
      a maven index is null which was leading to a processing error
      by the lister.
      
      So handle that case to avoid premature exit of the listing process.
      
      Related to T3874
      v2.9.1
      0222a8f5
    • Antoine Lambert's avatar
      maven: Remove extraction of groupId and artifactId from pom files · 378613ad
      Antoine Lambert authored
      When parsing pom files, we are only interested to extract a VCS URL
      (git, hg, svn) in order to create associated loading tasks.
      
      In that case, the groupId and artifactId are not used by the lister
      so better removing their extraction, plus it will prevent errors when
      those info are missing in pom files.
      378613ad
    • Antoine Lambert's avatar
      maven: Create one origin per package instead of one per package version · 22bcd9de
      Antoine Lambert authored
      Previously the maven lister was creating an origin for each source
      archive (jar, zip) it discovered during the listing process.
      
      This is not the way Software Heritage decided to archive sources
      coming from package managers. Instead one origin should be created
      per package and all its versions should be found as releases in the
      snapshot produced by the package loader.
      
      So modify the maven lister in order to create one origin per package
      grouping all its versions.
      
      This change also modifies the way incremental listing is handled,
      ListedOrigin instances will be yielded only if we discovered new
      versions of a package since the last listing.
      
      Tests have been updated to reflect these changes.
      
      Related to T3874
      22bcd9de
  8. Apr 28, 2022
    • Franck Bret's avatar
      crates: Create one origin per package instead of per version · 985b71e8
      Franck Bret authored
      Previously we had as many origins as version for a crate package, url was a link
      to a specific crate version package.
      
      Refactor to have one origin per package name and add an 'artifacts' entry to
      extra_loader_arguments that list all versions, package url and checksum.
      Origin url is now a link to the related http api endpoint for a package name.
      
      Related to T4104
      985b71e8
  9. Apr 26, 2022
  10. Apr 25, 2022
  11. Apr 21, 2022
  12. Apr 14, 2022
  13. Apr 13, 2022
  14. Apr 08, 2022
  15. Apr 06, 2022
  16. Mar 28, 2022
  17. Mar 22, 2022
    • Antoine Lambert's avatar
      pytest: Exclude build directory for tests discovery · ff0035a6
      Antoine Lambert authored
      Due to test modules being copied in subdirectories of the
      build directory by setuptools, it makes pytest fail by raising
      ImportPathMismatchError exceptions when invoked from root
      directory of the module.
      
      So ignore the build folder to discover tests.
      ff0035a6
  18. Mar 11, 2022
  19. Feb 18, 2022
  20. Feb 17, 2022
  21. Feb 16, 2022
  22. Feb 14, 2022
    • Raphaël Gomès's avatar
      sourceforge: fix support for listing bzr origins · 31b4429c
      Raphaël Gomès authored
      Bazaar support was removed a long time ago and predates a lot of the new
      mechanisms in place in the API. Unfortunately, it looks like a lot of
      the URLs are offline now, but there are still a few projects that can be
      listed, this is pretty low-effort.
      v2.6.4
      31b4429c
  23. Feb 10, 2022
  24. Feb 09, 2022
  25. Feb 08, 2022
Loading