So I would assume that if my app is consistently below 100 requests then eventually my app would scale down to 2 machines. I have 6 total. Instead I am consistently seeing ~4 running. When I look at the concurrency graph I donât see any one server ever going over 100. Shouldnât that mean that my server would scale down to 2 machines and chill there until I start hitting the limits?
@w8emv ah so it looks like it has to do with long lived connections like WS, those may not be showing the in the graph as clearly. That is my best guess based on the description I saw in that resposne you linked.