How does the scaling work in detail?

greg · April 2, 2022, 11:00pm

As regards the hard_limit, yes if that number is hit, that should trigger a new vm to be created. If you look at your app metrics in the Fly dashboard, you should see a graph of concurrent requests so a new vm would be created if it goes above the set limit.

What complicates autoscaling is there are two modes: standard and balanced. And then the vm distribution depends on another variable: how many regions your app is set to run in. You can see those with fly regions list. You can see more on this page, and if you scroll down there are various commands to set the options depending how you want it to work;

Upscaling should happen quickly (as soon as Fly spots the increased load), but for downscaling, that I’m not sure. As far as I’m aware you can’t specify a time/rule for when that happens. 5 hours sounds wrong, unless the load justifies it and/or the minimum is now two.

I see autoscaling is being reworked but hopefully someone from Fly can assist:

Topic		Replies	Views
How long does it take for autoscaling to kick in.	4	453	September 24, 2022
Autoscale doesn't seem to work with hard_limit = 1 and soft_limit = 1	13	1325	September 7, 2021
Autoscale doesn't seem to launch new instances Questions / Help	6	878	September 17, 2021
Fly autoscale downscaling Questions / Help docs	10	1427	September 22, 2022
Black Box Auto Scaling	1	409	July 31, 2020

How does the scaling work in detail?

Related topics