Snippets Groups Projects

Filter provenance pipeline to reduce the index volume

Define the heuristics of input filters for provenance index.

Proposed options are:

Process only tags/releases
Exclude epoch +/- a given range
Apply contents sizes filter ranges
Exclude "too popular" contents (number of occurrences)
Mime types filtering
File names

For each of these options:

Identify the data source to query to get the metric
Define the values to apply for a first iteration
Implement the filter handling in provenance

Edited 2 years ago

To upload designs, you'll need to enable LFS and have an admin enable hashed storage. More information

Child items 0

No child items are currently assigned. Use child items to break down this issue into smaller parts.

Activity

Benoit Chauvet changed milestone to %Provenance in production [Roadmap - Tooling and infrastructure] 2 years ago

changed milestone to %Provenance in production [Roadmap - Tooling and infrastructure]
Benoit Chauvet added activity::Deployment roadmap_import labels 2 years ago

added activity::Deployment roadmap_import labels
Benoit Chauvet removed roadmap_import label 2 years ago

removed roadmap_import label
Benoit Chauvet removed activity::Deployment label 2 years ago

removed activity::Deployment label
Benoit Chauvet added activity::Epic label 2 years ago

added activity::Epic label
Benoit Chauvet changed title from Filter provenance pipeline to process only tags and releases to Filter provenance pipeline to reduce the index volume 2 years ago

changed title from Filter provenance pipeline to process only tags and releases to Filter provenance pipeline to reduce the index volume
Benoit Chauvet changed the description 2 years ago

changed the description
Benoit Chauvet added priority:High label 2 years ago

added priority:High label
Benoit Chauvet added #4984 as child task 2 years ago

added #4984 as child task
Benoit Chauvet added #4985 as child task 2 years ago

added #4985 as child task
Benoit Chauvet added #4986 as child task 2 years ago

added #4986 as child task
Benoit Chauvet marked this issue as related to #4924 2 years ago

marked this issue as related to #4924
Benoit Chauvet mentioned in issue #4924 2 years ago

mentioned in issue #4924
Benoit Chauvet assigned to @douardda 2 years ago

assigned to @douardda
Benoit Chauvet added #4987 as child task 2 years ago

added #4987 as child task
Benoit Chauvet removed priority:High label 1 year ago

removed priority:High label

Please register or sign in to reply