Postgres DBs throwing alerts

bekit · September 20, 2021, 9:35pm

Our Postgres DBs in two different apps are going back and forth between alerting and okay repeatedly. It started in both apps at the same time. The errors are:

HTTP GET http://<IPv4-ADDRESS>:5500/flycheck/pg: 500 Internal Server Error Output: "[✗] proxy: context deadline exceeded"

Followed usually within a few seconds with a success:

HTTP GET http://<IPv4-ADDRESS>:5500/flycheck/pg: 200 OK Output: "[✓] replication: currently leader
[✓] proxy check: [<IPv6-ADDRESS>]:5432 connected
[✓] connections: 15 used, 3 reserved, 300 max"

It’s been happening every few minutes for about the last hour and a half.

Is there something I need to do to remedy this, or is it an issue on Fly’s end?

sanswork · September 20, 2021, 9:40pm

I’ve been seeing regular postgres errors for the past hour at least as well. tcp connections being force closed then connection refused for a few seconds then back like the server is cycling regularly.

kurt · September 20, 2021, 9:44pm

These both look like rate limiting issues connecting to the shared consul service. What regions is your DB running in? We’re investigating.

sanswork · September 20, 2021, 9:45pm

yyz here

bekit · September 20, 2021, 9:46pm

ord for me

kurt · September 20, 2021, 9:57pm

See if it’s better now? We increase the rate limits on Consul. It seems like they were near the cusp but consul is otherwise super happy.

bekit · September 20, 2021, 10:03pm

I saw a round of errors about 15 minutes ago. I’ll keep an eye on it and let you know.

kurt · September 20, 2021, 10:04pm

That fits, here’s what the response graph looks like for the consul your DBs are using:

The burst of errors there were when we updated the config, then traffic flattened back out (traffic to this should be really flat).

sanswork · September 20, 2021, 10:20pm

Seems to be all good here now. Thank you.

bekit · September 20, 2021, 10:48pm

Yep, been stable for the last hour here as well. Thanks.

bekit · October 2, 2021, 5:43pm

This seems to be back, though not as severe. I’ve seen 15+ instances of the error so far today on two different apps in ORD. @kurt is this same issue, or something new?

Topic		Replies	Views
Postgres database apps are crashing again	22	1208	October 25, 2022
Postgres "failed to connect to proxy: context deadline exceeded" Questions / Help postgres	19	2233	October 14, 2023
Postgres "server misbehaving" error messages postgres	8	2090	October 26, 2022
Fly Postgres proxy issues and dropped connections(started 2 hours ago) Questions / Help postgres	2	513	December 17, 2022
Deploys failing LHR, postgres proxy failures, intermittent db connection issues	5	468	October 20, 2022

Postgres DBs throwing alerts

Related topics