Fly Postgres v1 replication errors after OOM

juliancarrivick · May 7, 2023, 3:04am

For completeness, I didn’t ever figure exactly what went wrong, but I ended up fixing it by recreating the volume:

Figured out which volume was the one in use: fly volumes list
Created a new one to be used fly volumes create pg_data --region syd --size 10
Scaled back up (and luckily the new VM used the new volume): fly scale count 2
Deleted the dud volume: fly volumes destroy $vol_id

I was then still left with the no keeper info available logs, but according to this post, this is fixed in a newer postgres image. So I updated using fly image update and hopefully they will stop after ~48 hours.

Topic		Replies	Views
Postgres (PG) Database (DB) issue: "checking stolon status" and "Error opening connection to database" Questions / Help postgres	4	428	May 4, 2023
No Keeper Available, runaway WAL Questions / Help postgres	12	1149	July 31, 2023
WARN cmd/sentinel.go:276 no keeper info available	6	703	February 19, 2024
Postgres health checks perpetually failing Questions / Help postgres	3	1062	March 2, 2023
Postgres crashed - cannot restart, restore or fork Questions / Help postgres	5	413	November 10, 2023

Fly Postgres v1 replication errors after OOM

Related topics