Ingest git-arr
Summary:
A lister is needed for git-arr, a simple web frontend for git repositories.
There are no easily searchable markings on git-arr sites unfortunately.
The front page has a h1
with content git
, then a table
with class projects
, the first tr
has two th
tags with project
and description
content. The following tr
tags have td
tags with classes name
, desc
and both have a
tags with repo URLs in the href
attribute, then the last td
has a span
class age
that has text content that is an integer representing the Unix time_t of the last update to the repository.
The repo pages have body
class summary
, with a table
class repo_info
. Each tr
in the table has different info. The first td
has class category
and can be used to differentiate web links (content website
) from clone URLs (content git clone
). The second td
has the website URL or one clone URL or two/more clone URLs split up by br
tags.
I was only able to find one instance so far:
https://blitiri.com.ar/git/
Plan:
-
Implement lister -
Implement loader -
docker: Run lister -
docker: Run loader -
Document lister -
Document loader -
Deploy on staging -
Call for public review -
Deploy on production