Investigate CRAN revisions without an origin
19:58 <vlorentz> we have a revision 00a867beb2ad8e203f242e9843d2e88de0856cda
19:58 <vlorentz> which was obviously created by the CRAN loader, from its content
19:59 <vlorentz> and the origin should be https://cran.r-project.org/package=mljar
19:59 <vlorentz> but that origin doesn't exist
19:59 <vlorentz> additionally, the revision is referenced by a snapshot (f7ce27eb87dc82e02b6178db3c9d9dbc9fa055d6) which isn't reference by any visit status
19:59 <vlorentz> I could understand an orphan snapshot, but not the missing origin
20:01 <vlorentz> any idea what happened?
20:01 <vlorentz> the revision was created on 2020-01-09T13:14:43.438615+00:00, btw
20:02 <+olasd> one could fish out the cran_loader worker logs for that date, see what happened
20:03 <+olasd> (using kibana)
20:03 <+olasd> but for such old logs you'd need to reopen the elasticsearch indexes first
but I can't access the logs.
I opened the index, but Kibana doesn't see it. (and it's half unusable anyway, swh/infra/sysadm-environment#2534 (closed)), and ES doesn't return any document when searching in the index:
$ curl "esnode1.internal.softwareheritage.org:9200/swh_workers-2020.01.09/_search" -X GET -H "Content-Type: application/json" -d '{"query": {"match_all": {}}}'
{"took":0,"timed_out":false,"_shards":{"total":0,"successful":0,"skipped":0,"failed":0},"hits":{"total":{"value":0,"relation":"eq"},"max_score":0.0,"hits":[]}}
List of "grand-orphan" revisions I found so far:
- 00a867beb2ad8e203f242e9843d2e88de0856cda (origin: https://cran.r-project.org/package=mljar )
- 024bee9f941d1cc9f11daebc72b42332a7af9f31 (origin: https://cran.r-project.org/package=ROI.plugin.ipop )
- 028e9890a9287b35851c48ca351641743542d030 (origin: https://cran.r-project.org/package=PhaseType )
- 08dd80bdaf5cf36f940865e8d7e0556cad6d881d (origin: https://cran.r-project.org/package=SchemaOnRead )
- 09e34df7727658fd5db298bb3d4dd862a68768e7 (origin: https://cran.r-project.org/package=multilevelMatching )
- 0b17032b3021e24424f8d5d2694b3cf207df6854 ( https://cran.r-project.org/package=MetamapsDB )
- 0d9502792997d32416f9b2e0582d6eb80185723b https://cran.r-project.org/package=spartan
- 243056d1bfb23e22844b477f8dbbd09e5bf5ebf6 https://cran.r-project.org/package=ActivityIndex
- 2431636ecb54fb0d0e340eb82e0abaf6ef3897d7 https://cran.r-project.org/package=DZEXPM
- 27254839c516a21d72de8ebdfe80ee559d2affe7 https://cran.r-project.org/package=rinat
- 2c478e2d48e70b2bd5dbf53c271f9727511ab8b3 https://cran.r-project.org/package=ReliabilityTheory
- 2db27bd5b2907bd8a71c437a972016e1480dc2d9 https://cran.r-project.org/package=IPtoCountry
- 2e8bbe5ab4f49c6e9f73c995d5685d9b78d36b1e https://cran.r-project.org/package=refGenome
- 3662f0d1e781eaaa6796c4a639111d7b9a787163 https://cran.r-project.org/package=JuniperKernel
- 38b3070edfc018308a0bc3d9604d19d90c430c7c https://cran.r-project.org/package=BayHap
- 38e4aacb06cb218ee7c2fa38ddb16611e426d7bb https://cran.r-project.org/package=aws.sns
- 3a54793134512d29a9ce7c55834ddf1513d7f5ef https://cran.r-project.org/package=bclust
- 3f3d73c171248c7a618c7fd1e7e00f27aa6dde56 https://cran.r-project.org/package=cec2005benchmark
Migrated from T2536 (view on Phabricator)
Edited by Phabricator Migration user