Autoscaling auto_stop_machines not working

Hi @Wayrunner

The concurrency limits refer to the number of simultaneous requests, so it’s not per minute or second. It’s the total number of requests at any given moment.

Edit to add: so for example, if you know your app (with its current CPU and memory) can handle a maximum of 100 simultaneous connections, then that would be your hard_limit. You’d set your soft_limit slightly lower so that the proxy can start sending requests to another Machine before that max load is reached. The concurrency settings are used to auto start and stop, but they’re also used in load balancing.

2 Likes