Skip to content
GitLab
Projects Groups Snippets
  • /
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in / Register
  • S sysadm-environment
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 167
    • Issues 167
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 0
    • Merge requests 0
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Monitor
    • Monitor
    • Incidents
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Repository
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • Platform
  • Infrastructure
  • sysadm-environment
  • Issues
  • #4612
Closed
Open
Issue created Oct 07, 2022 by vlorentz@vlorentzMaintainer

Most indexers are consuming journal topics slower than messages are produced

Grafana dashboards show that most indexer consumers are working, but they are lagging behind, and the gap is increasing.

  • https://grafana.softwareheritage.org/goto/RJkpEXVVz?orgId=1 origin-intrinsic-metadata (on origin_visit_status topic)
  • https://grafana.softwareheritage.org/goto/c_iYPX4Vz?orgId=1 content-fossology-license (on content topic)
  • https://grafana.softwareheritage.org/goto/g1VPPX4Vz?orgId=1 content-mimetype (on content) topic
  • however, the extrinsic-metadata indexer is fine: https://grafana.softwareheritage.org/goto/3YQsPX44z?orgId=1

The plot of lag derivative shows a progressive slowdown, so it's probably not due to a specific configuration change.

I do not know what is causing this, though. Two possible suspects:

  1. rdkafka frequently disconnecting from the brokers (or generally, having connection isues): https://sentry.softwareheritage.org/share/issue/76ed328b2ae6465face2ea4bb5f32187/
  2. slow storage and/or objstorage (which would make sense, as the extrinsic-metadata indexer is super-fast, and is also the only one not to use the storage and objstorage)
  3. simply not having enough workers

Migrated from T4612 (view on Phabricator)

Edited Oct 18, 2022 by vlorentz
Assignee
Assign to
Time tracking