Allowing instances to report their current load

coxley · March 27, 2022, 4:17am

After going through the docs, this is the bigger piece missing for me. Fly can already consume your Prometheus metrics. Add an option for saying which metric is your “load counter” that reports 0-1000?

Concurrent requests isn’t the best indicator for load. Let’s take an app of mine: diagram rendering. The first time a diagram is rendered, it’s (relatively) computationally expensive. But we can cache it indefinitely — so I do. I even distribute the cache to other instances in the same region.

So after the first request, I can have thousands of RPS and be fine with one instance. I really care about the amount of compute going on at a given time on an instance. Ideally this would influence which instance is picked within a region, and with a configurable saturation policy, auto-scale up.

coxley · April 7, 2022, 6:25am

Any interest here?

Topic		Replies	Views
Request: Analytics based on Region	1	291	May 24, 2021
CPU autoscaling	4	431	June 3, 2021
The fly_app_concurrency metric is now more useful Fresh Produce	0	708	January 27, 2025
See number of running machines appsv2	3	279	June 16, 2023
Prometheus resource limitation? Questions / Help	0	279	November 2, 2022

Allowing instances to report their current load

Related topics