Commits · 3892a9c954f2ad6daee88257e91ecada50119a68 · Platform / Infrastructure / CI CD / Helm charts for swh packages · GitLab

Snippets Groups Projects

Jan 26, 2024
- production/storage: Migrate objstorage multiplexer to use banco rpc · 3892a9c9
  Antoine R. Dumont authored 1 year ago
  
  Refs. swh/infra/sysadm-environment#5226
  3892a9c9
- production/storage: Deploy banco objstorage pathslicing · 14f6ab7a
  Antoine R. Dumont authored 1 year ago
  
  Refs. swh/infra/sysadm-environment#5226
  14f6ab7a
- v237: Release swh.loader.mercurial v3.5.1 · 6bbe7caf
  Jenkins for Software Heritage authored 1 year ago
  
  6bbe7caf
- production/storage: Migrate remaining writing workload to dynamic rpc · f527ab55
  Antoine R. Dumont authored 1 year ago
  
  Refs. swh/infra/sysadm-environment#5215
  f527ab55
- production: Tweak autoscaling of vault cookers · 9f9d74e7
  Nicolas Dandrimont authored 1 year ago
  
  As the intrinsic parallelism has increased, decrease the extrinsic parallelism. ackLate allows us to disable "stopWhenNoActivity", and to let autoscaling do its work.
  9f9d74e7
- swh/production: Use AWS as main read-only objstorage backend · 4e346585
  Vincent Sellier authored 1 year ago and Nicolas Dandrimont committed 1 year ago
  
  and remove unnecessary indirections
  4e346585
- swh/production: Remove retry from read-only storage rpc backend · aa0ebd90
  Vincent Sellier authored 1 year ago
  
  aa0ebd90
- swh/storage: Support empty pipeline · 4dc63060
  Vincent Sellier authored 1 year ago
  
  4dc63060
- production/web-archive: Use tcp liveness probe · f3c61232
  Antoine R. Dumont authored 1 year ago and Vincent Sellier committed 1 year ago
  
  It seems to fail the same way the current storage rpc does.
  f3c61232
- production/storage-rpc: Use tcp liveness probe · 9001d7f0
  Antoine R. Dumont authored 1 year ago and Vincent Sellier committed 1 year ago
  
  That should avoid having cascading effect. When workers are too busy to handle that probe, the http liveness probe fails, this ends up restarting the pod, in effect, killing the ongoing requests. https://kubernetes.io/docs/tasks/configure-pod-container/configure-liveness-readiness-startup-probes/#define-a-tcp-liveness-probe Refs. swh/infra/sysadm-environment#5215
  9001d7f0
- swh/web: Use utils image to generate the config and add pod ip to allowed_hosts · 7446cf0d
  Vincent Sellier authored 1 year ago
  
  It's mandatory to allow prometheus to scrape the metrics directly from the pod. Related to swh/infra/sysadm-environment#5227
  7446cf0d
- swh/webapp: Add webapp metrics scraping configuration · a96340dd
  Vincent Sellier authored 1 year ago
  
  Limit the scraping to 1 pod as the metrics are the same for all pods Related to swh/infra/sysadm-environment#5227
  a96340dd
Jan 25, 2024
- production storage: reduce autoscaler CPU percentage · 6b72e826
  Nicolas Dandrimont authored 1 year ago
  
  CPU isn't a great proxy for what we actually need (busy gunicorn workers), but that's a start...
  6b72e826
- cookers: use envsubst instead of shell expansion · 04473a37
  Nicolas Dandrimont authored 1 year ago
  
  04473a37
- vault: fix the graph configuration · 4c6307b2
  Nicolas Dandrimont authored 1 year ago
  
  4c6307b2
- storage: ensure autoscaling is properly configured for all storages · a745dab5
  Nicolas Dandrimont authored 1 year ago
  
  The space chopping was concatenating the yaml files together...
  a745dab5
- v236: Release swh.vault v1.12.1 · 092b020a
  Jenkins for Software Heritage authored 1 year ago
  
  092b020a
- v235: Release swh.loader.cvs v0.8.2 · 04ebc3ba
  Jenkins for Software Heritage authored 1 year ago
  
  04ebc3ba
- v234: Release swh.vault v1.12.0 · 1f1dd394
  Jenkins for Software Heritage authored 1 year ago
  
  1f1dd394
- objstorage: ensure client_max_size is set at the proper level · ff4da73d
  Nicolas Dandrimont authored 1 year ago
  
  client_max_size is actually set at the toplevel of the swh configfile, not within the objstorage.
  ff4da73d
- staging: keep a small thread pool size for cookers · 97fcb0dd
  Nicolas Dandrimont authored 1 year ago
  
  The staging objstorage isn't especially fast
  97fcb0dd
- cookers: allow configuring thread pool size · 1ee490e0
  Nicolas Dandrimont authored 1 year ago
  
  1ee490e0
- cookers: allow configuring max bundle size · 788b7055
  Nicolas Dandrimont authored 1 year ago
  
  788b7055
- cookers: fix extra whitespaces · c66c6267
  Nicolas Dandrimont authored 1 year ago
  
  c66c6267
- cookers: add direct objstorage configuration · 3e97b88f
  Nicolas Dandrimont authored 1 year ago
  
  3e97b88f
- v233: Release swh.vault v1.12.0 · 0803d62d
  Jenkins for Software Heritage authored 1 year ago and Nicolas Dandrimont committed 1 year ago
  
  0803d62d
- swh/production: Reduce winery batch size · 098c9742
  Vincent Sellier authored 1 year ago
  
  The network looks very slow this days. The consumers fall in timeout and rafke ebalance all the consumers each time it appends. Related to swh/infra/sysadm-environment#5187
  098c9742
- production/storage: Use similar config as before (per replica) · dbd9dfd4
  Antoine R. Dumont authored 1 year ago
  
  Use 32 workers with 1 thread. Refs. swh/infra/sysadm-environment#5215
  dbd9dfd4
- production/storage: Use autoscaling · e0773da9
  Antoine R. Dumont authored 1 year ago
  
  For now, we don't know yet where it will settle and current 4 replicas does not follow with half our workers. Refs. swh/infra/sysadm-environment#5215
  e0773da9
- production: Migrate loader-cvs to dynamic rpc storage · c2fd13cb
  Antoine R. Dumont authored 1 year ago
  
  Refs. swh/infra/sysadm-environment#5215
  c2fd13cb
- production/loader-cvs: Bump requests resources according to use · 5f3b4b27
  Antoine R. Dumont authored 1 year ago
  
  5f3b4b27
- prod: Migrate loader-metadata to dynamic rpc storage · 620c6c8c
  Antoine R. Dumont authored 1 year ago
  
  Refs. swh/infra/sysadm-environment#5215
  620c6c8c
- production: Decommission deprecated nixguix loader · 3f18e27f
  Antoine R. Dumont authored 1 year ago
  
  The last runs have finished so we can stop and decommission them. Refs. swh/infra/sysadm-environment#5223
  3f18e27f
- production/storage: Bump requests usage to saam rpc · dd474931
  Antoine R. Dumont authored 1 year ago
  
  We migrated only around half the writers and the requests usage for both cpu and memory is already reached. Refs. swh/infra/sysadm-environment#5215
  dd474931
- prod: Migrate loader-{git,add-forge-now-slow} to dynamic storage · 7dddd062
  Antoine R. Dumont authored 1 year ago
  
  Refs. swh/infra/sysadm-environment#5215
  7dddd062
- prod: Migrate loader-{save-code,add-forge}-now to dynamic storage · 762f4c67
  Antoine R. Dumont authored 1 year ago
  
  Refs. swh/infra/sysadm-environment#5215
  762f4c67
- prod: Fix storage read-write rpc ingress name · ba622545
  Antoine R. Dumont authored 1 year ago
  
  Refs. swh/infra/sysadm-environment#5215
  ba622545
- prod: Migrate loader-directory to use dynamic storage rw instance · 1e0d0ec2
  Antoine R. Dumont authored 1 year ago
  
  The new one running on saam. Test it on one loader to incrementally check everything is fine. Refs. swh/infra/sysadm-environment#5215
  1e0d0ec2
- production: Rename saam storage configuration as legacy · 7f9e0944
  Antoine R. Dumont authored 1 year ago
  
  This will soon be migrated. Refs. swh/infra/sysadm-environment#5215
  7f9e0944
- loaders: Allow specific storage configuration per instance · 7d16b40a
  Antoine R. Dumont authored 1 year ago
  
  Refs. swh/infra/sysadm-environment#5215
  7d16b40a