I got an email from my monitoring solution that my site is down. Cloudflare confirmed the same.
I log in and see that the following errors in the app:
2022-02-21T11:27:54.929 app[b0a5866f] ord [info] 2022/02/21 11:27:54 ping db: failed to connect to `host=redacted.internal user=redacted database=redacted`: failed to receive message (unexpected EOF)
2022-02-21T11:27:55.829 app[b0a5866f] ord [info] Main child exited normally with code: 1
2022-02-21T11:27:55.830 app[b0a5866f] ord [info] Starting clean up.
So I check the database and I see that fly is reporting it to be happy and normal.
The logs say nothing either:
2022-02-21T11:37:45.992 app[9db2cae4] ord [info] keeper | 2022-02-21T11:37:45.992Z INFO cmd/keeper.go:1557 our db requested role is standby {"followedDB": "909c9634"}
2022-02-21T11:37:45.993 app[9db2cae4] ord [info] keeper | 2022-02-21T11:37:45.992Z INFO cmd/keeper.go:1576 already standby
2022-02-21T11:37:46.015 app[9db2cae4] ord [info] keeper | 2022-02-21T11:37:46.014Z INFO cmd/keeper.go:1676 postgres parameters not changed
2022-02-21T11:37:46.015 app[9db2cae4] ord [info] keeper | 2022-02-21T11:37:46.015Z INFO cmd/keeper.go:1703 postgres hba entries not changed
2022-02-21T11:37:49.214 app[3bd4ed4d] ord [info] keeper | 2022-02-21T11:37:49.214Z INFO cmd/keeper.go:1505 our db requested role is master
2022-02-21T11:37:49.215 app[3bd4ed4d] ord [info] keeper | 2022-02-21T11:37:49.215Z INFO cmd/keeper.go:1543 already master
2022-02-21T11:37:49.243 app[3bd4ed4d] ord [info] keeper | 2022-02-21T11:37:49.243Z INFO cmd/keeper.go:1676 postgres parameters not changed
2022-02-21T11:37:49.244 app[3bd4ed4d] ord [info] keeper | 2022-02-21T11:37:49.243Z INFO cmd/keeper.go:1703 postgres hba entries not changed
I deployed the app just over a month ago and it was running fine until 15 hours or so.
I see others facing a similar issue. Anything else that can be done here?