While I can configure service concurrency by request/connection it seems I have now way of finding out what the current concurrency value per machine actually is.
That would be necessary to even understand and then tune the concurrency limits for typical load situations and adjust auto scaling.
Not in an obvious way. I can find VM service concurrency, but that is the number of machines. I can for example find nothing whatsoever regarding request or connection concurrency- which is the configuration criterium.