Scaling limits on fly.io

Biswas · December 18, 2021, 2:11pm

I’m trying to see if fly.io can be a good alternative to Google’s Cloud Run and I wanted to ask if these two features are available in fly.io:

Is it possible to have a concurrency of 1 per container? In other words, a single request would be handled per container; I need this because the task is not parallelizable in a single container.
Is it possible to scale to a large number of containers (e.g. 1000 instances)?

catflydotio · December 19, 2021, 9:24pm

Hi @Biswas,

The quick answer to each of your questions is “yes,” with a caveat on #2.

Yes. Instances on fly.io are Firecracker VMs built with your container image. You can set a hard limit to restrict VMs to a single in-flight request: App Configuration (fly.toml)
Yes, you can scale to a large number of VMs. However, our autoscaling probably won’t do what you want; it happens every 15s, which is very slow if you need to add an instance in response to each request.

We do have a demo open source proxy that could be adapted to this: GitHub - superfly/machine-proxy: PoC HTTP proxy for scale-to-zero apps via the Fly machines API. This requires you to implement logic to start and stop machines based on requests, though.

Biswas · December 20, 2021, 7:18am

Thanks for the response!

So if I understand correctly, I can work around the autoscaling delay by writing a proxy that takes requests and spawns containers on fly.io? When using the APIs, what would be the expected delay for a new container? I could probably work with a delay of ~5s.

Topic		Replies	Views
Using fly.io as an alternative to AWS Lambda Questions / Help	6	2058	June 21, 2023
autoscale max instances	8	703	October 12, 2021
Issue with Autoscaling Based on Request Count in Fly.io autoscaling , proxy	5	72	October 27, 2024
Is there a way to replicate this setup in Fly?	2	587	February 27, 2022
How long does it take for autoscaling to kick in.	4	448	September 24, 2022

Scaling limits on fly.io

Related topics