- Nov 26, 2018
-
-
Antoine R. Dumont authored
-
Antoine R. Dumont authored
Prior to this commit, this was passing along real instance. This did not bode well with the __init__ call which expects configuration dict.
-
Antoine R. Dumont authored
-
Antoine R. Dumont authored
-
Antoine R. Dumont authored
Possibly there to satisfy tests so to remove.
-
Antoine R. Dumont authored
-
Antoine R. Dumont authored
Indexer storage expects identifier as bytes
-
Antoine R. Dumont authored
-
vlorentz authored
Reviewers: #reviewers, ardumont Reviewed By: #reviewers, ardumont Subscribers: ardumont, swh-public-ci Differential Revision: https://forge.softwareheritage.org/D707
-
- Nov 23, 2018
-
-
vlorentz authored
-
vlorentz authored
Make the OriginMetadataIndexer fetch rev metadata from the storage instead of getting them via the scheduler. Summary: Depends on D704 Reviewers: #reviewers, ardumont Reviewed By: #reviewers, ardumont Subscribers: swh-public-ci Differential Revision: https://forge.softwareheritage.org/D705
-
vlorentz authored
Reviewers: #reviewers, ardumont Reviewed By: #reviewers, ardumont Subscribers: swh-public-ci Differential Revision: https://forge.softwareheritage.org/D704
-
vlorentz authored
-
David Douard authored
related to T1330
-
vlorentz authored
-
vlorentz authored
Summary: Also fixes docstrings. Reviewers: #reviewers, ardumont Reviewed By: #reviewers, ardumont Subscribers: ardumont, swh-public-ci Differential Revision: https://forge.softwareheritage.org/D702
-
vlorentz authored
Summary: I made several choices while writing that Diff that are open to discussion: - Using GIN instead of GiST. That seems the most appropriate choice when reading https://www.postgresql.org/docs/9.1/textsearch-indexes.html ; but these part got removed from the doc of pgsql 10: https://www.postgresql.org/docs/10/textsearch-indexes.html - Using `pg_catalog.simple` as dictionary. Since we're dealing with any language and proper names, it seemed best to use a dictionary with no stop word. Though, arguably, most of the data will be English, and stop words usually don't appear in names. - It only supports conjunctions of search terms. I could easily add support for arbitrary levels of nestings and disjunctions/negations. That can be done later if we deem it worth it. - It indexes JSON keys too. It is probably possible to fix this, at the expanse of complicated SQL code, or some postprocessing in Python. Resolves T1334 and T1335. Test Plan: tox Reviewers: #reviewers, ardumont, olasd Reviewed By: #reviewers, ardumont, olasd Subscribers: olasd, ardumont, swh-public-ci Maniphest Tasks: T1335, T1334 Differential Revision: https://forge.softwareheritage.org/D658
-
- Nov 21, 2018
-
-
Antoine R. Dumont authored
Related T1326
-
Antoine R. Dumont authored
Related T1326
-
- Nov 20, 2018
-
-
Antoine R. Dumont authored
-
Antoine R. Dumont authored
And also be consistent with docstring.
-
Antoine R. Dumont authored
-
Antoine R. Dumont authored
-
Antoine R. Dumont authored
Move the decoding steps in test where it belongs
-
Antoine R. Dumont authored
Related T991
-
Antoine R. Dumont authored
-
Antoine R. Dumont authored
-
Antoine R. Dumont authored
So that tests that requires tests fixture without hypothesis does not break. Related T1354
-
- Nov 19, 2018
-
-
Antoine R. Dumont authored
- mimetype: no filtering is done. - fossology_license: Filtering to only compute on textual contents. Related T1312
-
Antoine R. Dumont authored
-
Antoine R. Dumont authored
-
Antoine R. Dumont authored
The debian stable packaged version hypothesis 3.6.1-1 holds a bug that prevents the indexer tests from running [1]. That's fixed in earlier version. [1] https://github.com/HypothesisWorks/hypothesis/issues/290 Related P334
-
Antoine R. Dumont authored
-
- Nov 17, 2018
-
-
Antoine R. Dumont authored
-
Antoine R. Dumont authored
-
Antoine R. Dumont authored
-
Antoine R. Dumont authored
-
- Nov 16, 2018
-
-
Antoine R. Dumont authored
-