Small consul cluster + litefs failover

tj1 · March 28, 2023, 10:23pm

Thanks for the additional information.

So, in theory, what I’m looking for is a floating dns name that can be moved to the appropriate host on failover, however, this doesn’t fully solve the problem as litefs also has to decide that it is now the primary (there are a few old-school ways of doing this VeritasFS for Oracle comes to mind where it actually killed all the IO on the primary).

We have two possibilities right now for advertise_url, but neither will work with a static lease.
<alloc_id>.vm.<appname>.internal
<region>.<appname>.internal

I know it’s rather silly, but even if litefs queried a static url for the master name / advertise url to failover, that could work ok with some decent monitoring.

Anyway, it looks like for failover, we’re going to need a small consul cluster atm for v2.

Did some more testing. When litefs process is killed on the machine, it fails over immediately. When fly machines stop is used, it hangs for a bit. Perhaps a tcp socket is lingering?

Topic		Replies	Views
LiteFS with consul leasing: Cannot connect to consul due to failed cert verification elixir , litefs	13	1072	June 2, 2023
litefs on fly with consul Questions / Help consul , litefs	2	801	August 22, 2022
My site is in a really bad state litefs	19	737	November 5, 2023
Issues connecting to Consul with LiteFS consul , litefs	7	427	December 14, 2023
Connecting to a litefs cluster from outside of fly.io Questions / Help litefs	4	429	September 29, 2023

Small consul cluster + litefs failover

Related topics