Machines high availability

pier · March 1, 2023, 6:13pm

So the machines docs mention that:

Machines VMs are tied to hardware, even if they don’t have a Volume. If the hardware goes down, the instance goes down.

If I want to run a v2 app with HA what setup should I use?

If I create two machines in the same region I really can’t know if these are running in the same hardware or not.

Should I create two machines in different regions? Will the routing layer try machine A first and if it’s down try machine B?

Also, if one of the machines is too busy, is there a way to send a response from machine A so Fly will instead send the request to machine B?

JP_Phillips · March 1, 2023, 6:18pm

It’s not explicitly documented but the platform defaults to spreading machines across hosts within a region and at some point we’ll expose more control over desired placement.

I believe the best way to handle this is setting the proper concurrency on the machine’s service.

pier · March 1, 2023, 6:22pm

Say I put concurrency to 1 and have 2 machines. What happens if a third concurrent request comes in?

JP_Phillips · March 1, 2023, 6:50pm

It “queues” and retries for a while and then stops retrying and closes the conn / returns an http error code.

ignoramous · March 1, 2023, 8:38pm

Thanks.

Fly Proxy will not retry for raw TCP / UDP conns, will it?

charsleysa · March 2, 2023, 2:54am

If this is an http service I believe there is a way. Machine A can include the Fly-Reject header in the response (it can have any value) and fly will try a different machine instead.

kurt · March 2, 2023, 3:41am

No, TCP connections are one and done. We’ve experimented with retry like features, but not figured that out yet.

UDP is fire and forget, those packets don’t even hit the userland proxy right now.

pier · March 2, 2023, 3:44am

Can’t find any reference in the docs for that Fly-Reject header

Anyway, I think a better approach would be to have another app doing the fly control and triggering machines on and off.

Topic		Replies	Views
Fly machines scale out in multi region autoscaling , proxy	3	224	June 16, 2024
Is it possible to decide which machine to request to? Questions / Help machines	8	257	April 25, 2024
Two or more machines, all placed on a single (failed) host?	2	53	August 26, 2024
App resiliency feature round-up Fresh Produce docs , appsv2 , machines	6	889	June 15, 2023
Better understanding best practices for HA for both web apps and PG apps Questions / Help	12	1203	May 2, 2023

Machines high availability

Related topics