Skip to content
GitLab
Projects Groups Snippets
  • /
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Register
  • Sign in
  • S sysadm-environment
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 180
    • Issues 180
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 0
    • Merge requests 0
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Monitor
    • Monitor
    • Incidents
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Repository
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • Platform
  • Infrastructure
  • sysadm-environment
  • Issues
  • #3357
Closed
Open
Issue created Jun 02, 2021 by Vincent Sellier@vsellierOwner10 of 12 checklist items completed10/12 checklist items

Perform some tests of the cassandra storage on Grid5000

In order to test the behavior of a cassandra cluster during the normal operations (global performance on bare metal servers, node maintenance impact, rebalancing, ...), we should run some tests on grid5000 infrastructure

The POC will be separated in several phases:

  • Prepare scripts to build the environment and run small iterations to validate it will be possible to run the tests with interruptions
    • Validate the way the data will be kept between 2 cluster restarts
    • Have generic scripts that could configure the cluster according different hardware (memory / cpu / SSD, SATA or mixed / number of nodes / ...)
    • Add a monitoring stack to measure the cluster behavior
  • Import a big enough dataset to be representative of the reality (probably during the night or a week-end)
    • define the minimal target to reach to consider the dataset representative
  • perform some benchmarks, check the behavior and the performance impacts during normal operations
  • compare ScyllaDb / vanilla cassandra performances
  • authentication to allow r/o access only
  • [option] test backfilling an empty journal from cassandra The backfill is based on sql queries and highly coupled with postgresql

The final goal of the experiment is to :

  • define the minimal cluster size to maintain correct performance during maintenance operations / node failures => 5 nodes is recommended to avoid too much pressure on the remaining nodes in case of an incident with only 3 nodes (run + recovery)
  • possibly test the performance on the different hardwares provided by grid5000

Migrated from T3357 (view on Phabricator)

Edited Oct 18, 2022 by Vincent Sellier
Assignee
Assign to
Time tracking