Load balancer documentation

havarnov · September 16, 2021, 9:08am

Is there any documentation on how the load balancer work? Let’s say that I have multiple instances running in the same region, how will the load balancer determine which instance is hit?

kurt · September 16, 2021, 9:50pm

We don’t have any docs on load balancing, I don’t think. It’s relatively simple, here’s the priority list:

Send request to the nearest VM under the soft limit set in the app config
a. If there are multiple options, take two least loaded and pick one at random
If all VMs are over soft limit, send request to VM that’s under the hard limit set in the app config

There’s extra logic in there to retry different backends, but that’s mostly transparent.

havarnov · September 17, 2021, 6:25am

Thanks, this is the information I was looking for! One more question: how does the load balancer take the two least loaded? Is it using load averages, cpu or something else?

jerome · September 17, 2021, 12:24pm

It’s using the number of connections / requests as a metric for least loaded. Eventually we’d like to be able to load balance based on other metrics.

mathiasn · February 4, 2022, 8:25am

Hello,
is there a difference for services.concurrency: type = "connections"?

Topic		Replies	Views
Load balancing within a region	1	876	July 12, 2022
Load balancing with the concurrency soft limit parameter	1	634	February 23, 2022
Load balancing based on the pool of connections to the database databases , proxy	2	37	June 11, 2025
understanding load balancing autoscaling , proxy	11	256	September 23, 2024
Load balancing that does not distribute traffic Questions / Help machines , proxy	2	46	April 17, 2025

Load balancer documentation

Related topics