db-host (4 vCPU) is paging alerts: load average ~8 and the database is crawling. The on-call's first instinct is "the CPU is melting — find the hog and kill it." But load is not the same as CPU. Work out what's actually saturated, identify the real source, and name the fix — without killing the wrong thing.
db-host, Debian 12, 4 vCPU.node API, a Postgres instance, and an hourly backup job.You've solved it when:
top's header: load ~8 on 4 cores, BUT %Cpu(s) shows lowus and high wa (I/O wait) with the CPU mostly idle — so this
is NOT CPU saturation.
top, or ps -eo pid,%cpu,stat,comm)rsync backup plus the Postgres backends blocked behind it.
You've rejected the node process (STAT R, ~12% CPU) as the decoy it is.
ionice the runaway