Fly Postgres down and no recover

markoskt · May 16, 2024, 6:17pm

Hi, I have a Postgres cluster with 3 nodes in the GRU region and today it seems that one of the nodes went down and wasn’t able to recover.

It seems to me that it was. Fly.io network error.

I thought the 2 replicas would recover automatically on these situations. But my application is down because of this

john-fly · May 16, 2024, 6:23pm

Hi Markost, can you give some more details about this? Name of the app? I can’t find a Fly.io user associated with your forum username in our system.

markoskt · May 16, 2024, 6:44pm

@john-fly the app name is spot-leiloes-db-prod.

john-fly · May 16, 2024, 6:52pm

I see. We were rebalancing Machines on our host servers in GRU; I’m not sure why you’re experiencing downtime because it looks like two of your cluster are running at any point. My first reaction is to scale up to 4 for the moment to try and regain quorum. Try that while I investigate more.

markoskt · May 16, 2024, 7:07pm

@john-fly the problem is that I am way from computer now. Why the app is not connecting on the health node? Is fly moved that from a replica to master?

Is there anything you can do for me here?

The app name is spot-leiloes-graphql-prod and it’s unable to connect to the database.

Shouldn’t fly advise for this rebalance work in advance so we could be prepared?

john-fly · May 16, 2024, 7:12pm

I am now totally focus on this app and am trying to fix it for you; I’ll answer your other questions once the DB is back up.

markoskt · May 16, 2024, 7:13pm

Thank you, appreciate that

john-fly · May 16, 2024, 8:00pm

Can you see if you’re back online?

markoskt · May 16, 2024, 8:04pm

@john-fly it’s not. Seeing the same erros in the live logs as well, Db is still dowb

john-fly · May 16, 2024, 8:08pm

No, I mean your frontend app; forget that DB app for now; is your fronted app working?

markoskt · May 16, 2024, 8:21pm

@john-fly No. it’s not. The app can’t connect to the database. It seems something worse.
I am at the computer now, and I can’t connect to the database either by using the fly proxy.

markoskt · May 16, 2024, 8:22pm

john-fly · May 16, 2024, 8:27pm

Can I email you at the email on file with your account?

markoskt · May 16, 2024, 8:28pm

Yes

markoskt · May 20, 2024, 11:22am

@john-fly Please let me know when you are available to help me downgrade the Database instance from 8gb to 2gb as we have discussed and also the credit regarding the incident as I think this new instance will increase the monthly bill considerably. Thank you.

john-fly · May 20, 2024, 9:41pm

Hi Marco, no I didn’t forget; I sorry I just had some other things I needed to do first. I’ve just sent you an email following up on all the points we talked about.

system · May 27, 2024, 9:41pm

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
GRU - Database error postgres	5	233	December 23, 2022
Postgres troubles. My app stopped :( Questions / Help postgres	1	130	June 14, 2024
Unable to connect to my postgres instance Questions / Help postgres	6	585	February 22, 2023
Postgres troubles postgres	13	472	June 18, 2024
DB Connection issues elixir , postgres	18	2470	October 10, 2022

Fly Postgres down and no recover

Related topics