Increased HTTP response times after June 11 2025

warui · June 12, 2025, 7:52pm

I noticed that my machine on fly.io response times spiked from 200ms to 10 seconds

My machine is in iad and no modifications have been done to the code of late.

Increasing cpu does little to help.

Could anybody else be experiencing this and what could be the issue?

halfer · June 12, 2025, 10:40pm

What language/framework do you have in your machine? Just wondering whether there could be some issues in your app.

Have you tried shelling in and timing some local HTTP calls? What kinds of timings do you get if you spin up a blank machine in the same region and in the same app, and access your app over private networking?

warui · June 13, 2025, 5:04am

I’m using django.

No apparent issues on the app as logs show ms response times.

At the moment, it seems to be an issue with fly.io routing because this morning the response time seem to be dropping back to ms.

The image shows response times for the last 6 hours. Today morning they’ve suddenly* dropped again.

I have not done any deployment whatsoever.

But why for two days?

Simply jumping from 300ms to 10s response times is very serious

Or should I move to a dedicated ipv4 address?

halfer · June 13, 2025, 8:03am

Sure, but there’s external factors within your control that could cause this; a badly-tuned database is one. I do agree that it is probably Fly networking, but it’s worth doing as much investigation as you can.

warui · June 13, 2025, 9:49am

A badly configured DB would show in the logs.

I fetch data from the db (not hosted by fly.io) quite frequently and additionally use cache.

If it was the DB, I would notice it from the logs. That is, the time difference between fetching and receiving data from the db would show just from the logs themselves.

pavel · June 13, 2025, 10:47am

@warui

This graph shows the response time of your app as seen by the proxy running on the worker host. So communication between the proxy and the machine happens over local interface, no network involved. Full response time that includes the time it took for us to route the request from the edge to the worker is shown on the “Fly Edge” dashboard.

With that information, I don’t think it’s a routing issue, unless you send large requests with body that could’ve became slower if it was indeed a routing issue.

Does you app need to talk to external resources while serving a request?
What kind of requests does your app handle - do they have body or not?
Do you have any kind of tracing to check what takes the app so long to respond?

warui · June 13, 2025, 11:57am

The fly edge graph is below and looks just the same.

My app does not talk to external resources while serving requests.

Most of my requests are create and read(for most users).

My challenge is explaining the HTTP response time spike from 300ms to 8s on average when no deployments or increase of users have occured.

pavel · June 13, 2025, 12:23pm

@warui

I just tried curling one of your machines directly from the host where it’s located, and it sometimes takes several seconds to respond.

Could you try doing the same from inside the machine? I checked on 2871259f6e4138 using /api/workers/health/ endpoint.

If you can reproduce it from inside the machine, it might be something with the app itself. The host the machine is running on has plenty of available resources, so I don’t think it’s a noisy neighbor problem. You mentioned the app accesses a DB outside of fly. Does it need to do so when serving the endpoint that I tried? Does it need to talk to other machines in the app while serving the endpoint? Or is the request served based on the information available in the machine and so doesn’t involve any network calls at all?

Ideally, you’ll need to instrument the app to see which part of the app takes so long to respond, otherwise it’s kinda hard to guess where a problem might be.

halfer · June 13, 2025, 1:44pm

This remains a poor rule of thumb, as I’ve already mentioned. Something may have changed that is under your control. I’m not having a go at you, but I’d want to help by advocating the right investigation practices.

In light of Pavel’s remarks, I’d suggest baking into your build an extremely trivial listener that operates on a different port. They basically should reply with hello in HTTP plaintext, with as simple a stack as you can muster. Definitely no Django. I should think Python has a simple built-in web server, or has a library for the same.

system · June 20, 2025, 4:01pm

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
HTTP response times all over the place Questions / Help	27	1552	March 30, 2023
Fly response times are inconsistently slow Build debugging	7	681	February 6, 2024
Slow response times since yesterday Questions / Help	8	883	October 27, 2023
Very slow app response times Questions / Help machines	6	162	November 2, 2024
Experiencing massive slow responses on both Fly.io and app hosted on it	6	579	March 16, 2023

Increased HTTP response times after June 11 2025

Related topics