Detect and index intrinsic licence information at the project level
Well maintained software projects include proper metadata information about the licence they use, either through a LICENCE or COPYING file at the toplevel or (non exclusive) following the REUSE specification (see https://reuse.software/spec/#license-files).
We want to detect and index this metadata information, and associate it to the proper level (TBD: directory, revision, release, origin, let's discuss the best approach).
In the end, we want to have the ability, for example, to show something like what is found on https://git.sr.ht/~gregkh/usbutils (sourcehut knows about REUSE) or https://github.com/rdicosmo/updateswh (but better than that: a project may have multiple licenses).