Request: Autoscale without killing all VMs to do so

danwetherald · August 26, 2020, 12:31am

This might be possible, but we are using the Balanced scaling strategy but find it strange that when Fly decides to reallocate resources based on traffic that every VM needs to be re-deployed.

Is there a way to avoid the need to have all VMs in an app to reboot and kill the Redis cache?

The way I see this working would be to only remove the least used VM and move it to the newly desired region without touching the other VMs in the pool.

kurt · August 26, 2020, 12:36am

This is actually a regression in Nomad (which we use to schedule these jobs). We can’t change balancing without restarting all the VMs. We throttled these changes down to once per hour to keep the disruption minimal, and have some workarounds planned for later.

Right now, the best bet if you don’t want to churn VMs like this is to turn balancing off, pick regions ahead of time, and let it scale.

danwetherald · August 26, 2020, 1:00am

Makes sense, would love this to be a feature in the near future, because I also think limiting it to 1 hour could also miss some advantages of the “edge” balancing being too late.

So the recommended configuration would be to move to the Standard scaling strategy with just a pool of regions where we want to always be available?

We are mostly US traffic, so being so, what would you suggest as the pool and min/max on the standard scale setting?

kurt · August 26, 2020, 2:12am

In the US I’d probably do lax/ord/ewr with a min_count of 3. That should get you really low latency from most places.

The restarts aren’t much of a problem, usually, so it’s fine to leave auto balancing enabled too!

danwetherald · August 26, 2020, 2:24am

Sounds good, I will give that a try!

Thanks so much!

danwetherald · August 26, 2020, 8:37pm

@kurt - How does this look? We are still on Balancing Min=5, Max=25

Thanks!

danwetherald · August 27, 2020, 2:11am

I am still seeing hourly scale events with this setup, so we are still loosing cache on an hourly basis.

What is the best way to prevent this.

PS> If you look at the screen shot about you will notice there is 2 VMs in lax and no VM in iad.

Thanks,

Dan

kurt · August 27, 2020, 2:19am

That app is still “Balanced”. Did you run flyctl scale standard min=X?

If you’re trying to retain cache, you’re probably better off using Redis: https://fly.io/docs/reference/redis/

We have a hidden setting that will limit each region to one VM, if you switch it to standard I can enable that for you. It will occasionally cause deploy problems if your app needs more VMs than there are regions available but it works really well if you define backup regions.

danwetherald · August 27, 2020, 3:12am

Gotcha, I thought you meant I could leave it on Balanced if we did the number of regions in the pool as the same as the MIN.

I have now updated the scaling strategy to Standard.

We actually are using Redis and it is one of the reasons why we wanted to prevent the VMs from dropping so often in the thought that the Redis DB was local to each VM. Is this true for Database 0?

I just noticed there is a way to share the Redis DB across all the VMs by connecting to Database 1, this makes me think that the Redis DB is not local to each VM and there is a Redis DB per App?

Either way, it sounds like we need to start using Database 1 so that cache is updated for all VMs everywhere.

Thanks again!

kurt · August 27, 2020, 1:40pm

The Redis cache is per region. It’ll survive between app instances. Database 1 is designed to push changes to other regions, mostly for things like purging keys though.

Topic		Replies	Views
Black Box Auto Scaling	1	409	July 31, 2020
Autoscale doesn't seem to work with hard_limit = 1 and soft_limit = 1	13	1325	September 7, 2021
Fly Autoscale Balanced - Max Per Region	2	344	November 18, 2021
How to disable count scaling, to enable autoscaling? Questions / Help	3	1184	November 27, 2021
How long does it take for autoscaling to kick in.	4	453	September 24, 2022

Request: Autoscale without killing all VMs to do so

Related topics