CPU autoscaling

pier · June 1, 2021, 5:30pm

Correct me if I’m wrong, but AFAIK autoscaling in Fly only works based on concurrent connections.

Is CPU usage also taken into account?

Eg:

I’m working on a service for encoding audio files with ffmpeg which is pretty heavy CPU intensive. Ideally I’d want a new VM to spin up if the current VM is already encoding a file.

jerome · June 1, 2021, 5:40pm

We don’t currently take that into account no. We might in the future as we are improving our balancing algorithm.

Eventually we might even be able to let people define exactly which metrics are relevant for balancing and scaling their app.

jsierles · June 1, 2021, 5:45pm

Hooking up to a Fly-hosted prometheus metric would be great! This is how I have achieved scaling based on a Buildkite queue depth using the Kubernetes HPA, with much pain involved

pier · June 3, 2021, 4:57pm

What if I set up concurrency to 1?

  [services.concurrency]
    hard_limit = 1
    soft_limit = 1

Would that spin up a new VM on every new concurrent request?

kurt · June 3, 2021, 5:29pm

We don’t scale fast enough to do that effectively. Right now, scaling happens as a background process that uses metrics to pick the count. Metrics are only scraped every 15s, so there’s some lag before we even start a VM.

Topic		Replies	Views
Autoscaling on CPU utilization? Questions / Help	15	2081	May 18, 2023
Auto Scaling - The threshold of when to scale up. Questions / Help docs	7	1062	August 18, 2022
Issue with Autoscaling Based on Request Count in Fly.io autoscaling , proxy	5	82	October 27, 2024
Autoscale from metrics	6	444	November 9, 2023
autoscale max instances	8	727	October 12, 2021

CPU autoscaling

Related topics