Hi there,
since months we have on and off issues with deploying our app inside Fly. Sometimes it bugs for hours where it cannot connect from the release command to our database.
Our machines all connect to our Tailscale network to get access. In the past it worked perfectly, but then it started getting more more apparent. Today the issue persists since hours. But the thing is - all running machines work ok. There is no problem with their connection to the database server. I can even scale more machines.
I added a pg_isready
to our entrypoint now and it seems like the release command firecracker cannot get a connection at all (container is waiting for a connection since half an hour now).
I’d have loved to analyze this issue further now, but I was surprised to see that I cannot ssh into a release instance? I really don’t know what else to debug now. The machine itself is pingable inside my tailscale network.
I hope somebody can help me here…
P.S: If I clone the release machine, it has the exact same problem