List interesting origins for the content provenance information prototype
List GitHub repositories sorted by stars, until we fill up disk spa^W^W^W^W have a reasonable amount of them (10k repos for a start sounds reasonable).
Relevant API: https://api.github.com/search/repositories?q=stars:%3E=1000&order=stars
Migrated from T551 (view on Phabricator)