Hi! I’m making some load testing on a simple static web server with some basic setup: standard 1-10 scaling, micro-1x VM, FRA region.
Starting with 50RPS everything seems fine:
Increasing up to 550RPS, interestingly seems even better:
I guess it’s because of internal caching - at the time of testing
flyctl logs was not working for me at all. At this stage the app was not scaling - still 1 micro-1x.
So then I set the rate to 1550RPS and it was still catching up, however the number of concurrent requests increased to 750.
At this point I’ve had to wait for a few minutes to get the app scaled up. During this period connections started to drop and I got some error responses too
But then it scaled up to 10 VMs and became responsive again. Maybe not on that level I’d have expected, though.
I was then trying to increase the rate above 2000 RPS but surprisingly the throughput remained fluctuating around 1900RPS. Maybe fly.io rate-limited my requests? I tried to increase the number of concurrent requests (“in-flight”) but results became even worst. So it seems to be the max throughput. I also observed that auto-scaling was a bit behind the actual demand.
Any comments or ideas on how to improve the throughput are welcomed. Thanks.