Is setting a minimum number of machines possbile? (Auto start/stop)

peter-kuhmann · May 5, 2023, 6:41am

I am trying out the new V2 machines. So far they are absolute great and start quite fast!

I see it as beneficial for many applications, that all machines are shut down when there are no connections at all. (Automatically Stop and Start App V2 Fly Machines · Fly Docs)

But still, I would like to keep always at least one machine instance running. My use case: Hosting a website that needs to load super fast (so no cold start up wanted).

Ideally I would like to have a setup like this:

One V2 app, one group with two - four machines in that group.
At least one machine always runs to handle spontaneous traffic.
The other machines can be auto started to handle traffic spikes.

So a fly.toml likes follows would be excellent:

[http_service]
  internal_port = 3000
  force_https = true
  auto_stop_machines = true
  auto_start_machines = true
  auto_stop_start_min_machines = 1

Here are my questions:

Is that already possible? I was unsure, if separate groups could help me: Scale the Number of Machines · Fly Docs
If “1.” can be answered with “No”: Is there a plan to implement such a scaling config?

Thanks in advance!

Best
Peter

senyo · May 5, 2023, 11:14am

Currently there is no way to set a minimum number of instances. We don’t have any immediate plans to support the feature. It’s something we’ve considered but weren’t sure there’d be demand. We’re happy to look into supporting it if there is.

peter-kuhmann · May 5, 2023, 3:15pm

Thanks for the info @senyo!

I personally would like to have such a feature. It would bring the autoscaling capabilities of V1 apps (https://fly.io/docs/apps/legacy-scaling/#autoscaling) to V2 apps.

“min” could be defined in the toml, and “max” would be implicitly defined by the number of machines.

Not knowing the internal architecture and potential challenges, I could imagine the following “solutions” for the “problem” based on these Docs :

Explicitly defining a min-machines number in the fly.toml.

Giving users the possibility to alter the auto stop “strategy”:

If there’s more than one Machine in the region:

the proxy determines how many Machines are over their soft_limit setting and then calculates excess capacity: excess capacity = num of machines - (num machines over soft limit + 1)

if excess capacity is 1 or greater, then the proxy stops one machine

If there’s only one Machine in the region:

the proxy checks if the Machine has any traffic

if the Machine has no traffic (a load of 0), then the proxy stops the Machine

Basically, a user would need to be able to change:

a) the value 1 in “if excess capacity is 1 or greater, then the proxy stops one machine”
b) the “one-machine-left-stop”-criterion: "no traffic/no load" | "never"

Marking individual machines as “static” respectively “non-auto-stoppable”. They would not be part of the auto-stop-evaluation-logic.

Best
Peter

smorimoto · May 5, 2023, 5:38pm

I’ve observed that when deploying Dockerized a static single executable in a programming language like OCaml, Rust, or Go, so far it starts in double digits of milliseconds, and the cost of cold-starting all machines in such cases is almost zero, whereas when cold-starting something like a large size Node.js application, it’s incredibly slow. This idea would be really useful in a case like this. It’s possible to keep at least one machine hot by constantly kicking the health check endpoint on a machine with the smallest shared CPU, but I’d really like to see such a great functionality implemented, because it would definitely mess up Fly’s great developer experience.

smorimoto · May 5, 2023, 5:39pm

I think this idea really makes sense

peter-kuhmann · May 6, 2023, 6:46am

… or a Java application, e.g. a Spring Boot backend

senyo · May 12, 2023, 4:35pm

We just added support for this: Setting a minimum number of instances to keep running when using auto start/stop.

As always, happy to field any questions, comments, feedback on the post

system · May 19, 2023, 4:36pm

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Question About Min/Max Scale Count On V2 Apps Questions / Help autoscaling	2	696	May 17, 2023
Setting a minimum number of instances to keep running when using auto start/stop Fresh Produce	25	5402	October 30, 2024
Machines being scaled to 0 even with `min_machines_running = 1`	5	876	July 20, 2023
Different min_machines_running per process in an app? django , machines	4	398	December 22, 2023
min_machines_running = 1 still scales the machine down to zero Questions / Help machines , autoscaling	14	207	January 25, 2025

Is setting a minimum number of machines possbile? (Auto start/stop)

Related topics