Skip to content
Snippets Groups Projects
  1. Apr 26, 2022
  2. Feb 07, 2022
  3. Jan 11, 2022
  4. Dec 16, 2021
  5. Oct 07, 2021
  6. Jun 09, 2021
  7. Apr 26, 2021
    • Antoine Lambert's avatar
      tox: Add sphinx environments to check sane doc build · 3f73d972
      Antoine Lambert authored
      Enable to check package documentation can be built without producing
      sphinx warnings.
      
      The sphinx environment is designed to be used in continuous integration
      in order to prevent breaking documentation build when committing changes.
      
      The sphinx-dev environment is designed to be used inside a full swh
      development environment.
      
      Related to T3258
      3f73d972
  8. Mar 08, 2021
  9. Mar 05, 2021
  10. Mar 02, 2021
  11. Feb 23, 2021
    • Tushar Goel's avatar
      Add Orchestrator · 89092343
      Tushar Goel authored
      
      This is to build a mechanism to write to write the data from clearcode database which has been mapped with swh storage into swh RawExtrensicMetadata, and the data that has not been mapped to
      a table unmapped_data. This process of orchestration will run periodically and will only try to map new data that has been entered after the last orchestration process and the data that was
      not mapped in last orchestration.
      
      Initialize tables if they don't exist in database. Initialize swh storage and add MetadataAuthority, MetadataFetcher, then map previously unmapped data and get last run date of orchestration
      then read data from clearcode and orchestor rows from clearcode DB (if whole row is mapped then in metadataStorage, if partial or no data is matched then store that row in unmapped data table
      (for future mapping purpose), if tool of row is fossoloy then skip that row.
      
      Add tests and docstrings
      
      Signed-off-by: default avatarTushar Goel <tushar.goel.dav@gmail.com>
      89092343
  12. Feb 19, 2021
  13. Feb 16, 2021
  14. Feb 02, 2021
    • Tushar Goel's avatar
      Map a row from clearcode toolkit with software heritage archive · 99402872
      Tushar Goel authored
      
      This is to build a mechanism to map a row [path(Primary Key), content(binary data), last_modified_date(timestamp with timezone), map_error(error message while mapping), uuid] from clearcode toolkit database, with software heritage archive using content table for sha1 and revision table for sha1_git and extract required information from that row. Then return list of data that has been mapped and mapping status(if able to map every hash of that row, will return True, else return False) so the row that is not being able to map for now can be stored in a state, and can be mapped in future.
      
      Add various exception classes in error.py that can be raised while mapping a row. Check if that row is a definition or harvest and also check if that row does not has invalid path, raise exception if path is invalid. If row is a definiton then map the data using map_definition and if it is a harvest then map it using map_harvest. Use storage instead of sql queries while mapping with the data inside archive. Add tests to cover all the cases and add docstrings to explain how every function works.
      
      Signed-off-by: default avatarTushar Goel <tushar.goel.dav@gmail.com>
      99402872
  15. Jan 20, 2021
  16. Dec 18, 2020
  17. Dec 17, 2020
  18. Dec 15, 2020
Loading