Autoscaling not distributing properly

Autoscaling happens when there’s sustained (for a few minutes), significant, traffic to a region. This is based on your app’s soft limit.

Autoscaling is probably not being triggered by a monitoring system, but should be by real-world traffic.

The way regions currently work, we don’t guarantee app instances will be placed in all the specified locations. It’s a list of regions where Fly may put instances.

Right now, we don’t support a “minimum per region” setting. The only way to achieve this is via Anchor scaling (incompatible with autoscaling) or if you use --max-per-region with your fly scale count (fixed count, will not autoscale).

If this is only for benchmarking purposes: the best way would be to set a fixed count with a --max-per-region so you can get your instances exactly where you want them.

If this is for real-world traffic: you can tweak your soft limit so autoscaling is triggered more often.

1 Like