Hmm, but I don’t necessarily want to loadbalance to other regions, because my machine in region X can handle way more and I care about the latency a lot.
also what is the point of a setting a soft_limit if it gets ignored?
According to the docs you linked : " Traffic will be sent to instance when it is closest instance that is under soft_limit"
My machine is way below the set limit and still fly is routing traffic away from it. I’m running tests from a single source, close to frankfurt and would expect almost all traffic being served from the machine in frankfurt because I specified to handle up to 800 soft_limit
in reality, there seems to be a hard cap at 25 requests though
The [services.concurrency] part will be ignored, because it’s not related to http_service, so the app will use the default concurrency settings, which are: type = connections, soft_limit=20, hard_limit=25
yes I had that originally and then changed it to use the http_service.concurrency and have since deployed a bunch of times via fly --config=./apps/api/fly.toml deploy --strategy immediate