Architecture question hosting and scaling webrtc rooms/bots

halfer · February 27, 2025, 8:24pm

I have something similar. Here’s my arch: How to prevent contention between background jobs in a multi-machine set-up?

I am building an API in my middle app, Distributor. This receives requests from Web that write to the database. I also have several Supervisord processes in the background that watch for database changes using simple loop polling:

If a status is requested and the record has not been processed, clone a template (stopped) machine and start it, and change the status to starting
If a status is starting and the target machine can receive requests, send the request and change the status to started
If there are too many machines running, add a minute to the request start time and change the status to delayed
There’s a process to put delayed requests back to requested if the running machine count drops
Every time a request is rejected it bumps up a retry count, and there’s a process to permanently reject requests that are rejected too many times (e.g. invalid request)

This may seem like a lot of work, but each process is only a bit of SQL run against a managed database, and a bit of language logic to move records between different statuses. Some queries have to be protected against race conditions, bearing in mind that this process will have redundant copies.

Your situation will be a little different in that rather than starting machines on demand, you have a pool of running ones. That’s just another process in a process manager to start extra machines over the number that are occupied with real users. You might also have something to destroy machines that no longer have a user, so that the free pool is kept at a consistent number.

Topic		Replies	Views
Autoscaling on CPU utilization? Questions / Help	15	2047	May 18, 2023
Auto-scaling based on response time?	5	909	September 18, 2021
Advanced concurrency, scaling & load balancing Questions / Help machines	8	109	January 17, 2025
Random machine stoppage Questions / Help machines , streams , autoscaling , proxy	2	37	November 22, 2024
Question About Min/Max Scale Count On V2 Apps Questions / Help autoscaling	2	681	May 17, 2023

Architecture question hosting and scaling webrtc rooms/bots

Related topics