Between 17:30-20:00 PST on the 28th, and again 07:00-10:00 PST 29th (today), One of my postgres machines crawled up to 100% volume and CPU usage then crashed.
This was fixed once I restarted the machine, but I’m freaked out that it will happen again and I’ll end up with another production crash.