Also I’m using uptime services to make sure that my app is not failing to respond from different regions, and there has not been any downtime, even though the error logs are still appearing.
Is there anything I can do to investigate the cause of these errors? Any insight is appreciated, thanks!
Ah, I meant checking the Grafana logs, not the graphs. There could be some informational or warning log lines there that explain the issue.
(I am not disagreeing with your thesis that the 5xx comes from the proxy, but 9 times out of 10 issues raised in this forum are from user error; I thus think the due diligence is worth it.)
No problem I can show you an example of the logs too, unfortunately as I mentioned the error appears without any warnings or related errors with it, here’s an example:
As you can see, the other logs around it happen minutes before and after the [PU02] appear.
The “Blocked” logs are no errors by the way, it’s a simple middleware I have to reject suspicious requests.
Region is IAD for me. The https://fly.io/ website itself is very unresponsive for me too, taking seconds to load and sometimes failing with timeouts. This is an issue with Fly itself and not other websites. I am in Italy.
Also experiencing intermittent failures to establish connection through proxy to my healthy and idle backend running in FRA - caller request times out w/o ever reaching the machine. Let me know if any diagnostics/logs can help
I don’t see any hyper errors for your app in the last one hour . It looks like requests came through an edge in bom that was temporarily unable to connect to other servers but that has resolved :.
The app with the issue is an app with always on websocket connections (DIscord bot). The failing requests are intermittent REST HTTP requests. All fail with
could not complete HTTP request to instance: legacy hyper error: client error (SendRequest) (source: error from user's Body stream)
looks like you have a bunch of users in APAC region, and routing those requests to the EU can sometimes fail. The error message can be improved and we’d look into this.
Can you try deploying some apps in sin or bom that’s closer and see if this helps?
Hi @somt0xhi, currently my database is located in the FRA region, so using machines in distant regions could affect my app’s performance. Is there any alternative?
Also, what’s the cause behind the issues routing from APAC region to EU? Is this particularly a fly proxy issue, or can it still be related to my app?
I am facing the same issue. And it seems like this issue has been on going for quite some time. Attaching screenshot of error from last 24 hours. Like @oteiza-a mentioned, there are no other error messages or warning along with the main error log. And there is nothing in the graphs as well.
Can you please help me too? This is likely the root cause of API call failures we have been experiencing and I would like to debug and fix it asap.