Skip to content

dbreplica1 2018-06-30 event postmortem

dbreplica1.euwest.azure.internal.softwareheritage.org stopped responding around 04:00 UTC on 2018-06-30.

Monitoring data show an increase of running threads, processes, slab cache usage as well as an I/O wait peak immediately before that time.

It is most likely possible the VM was not appropriately sized and couldn't handle a load spike.


Migrated from T1127 (view on Phabricator)