Do bluegreen deployments wait for machines to stop?

morse · August 7, 2025, 11:21am

I want to support long running requests for streaming AI responses. Requests can take minutes. I currently prevent fly.io from scaling down the machines by capturing SIGINT signals and witing until requests are done to stop the machine.

When doing a bluegreen deployment will the fly deploy command wait for machines to exit? This could take up to 5 minutes in my case.

Is there a way to prevent fly deploy to wait for old machines to stop?

morse · August 7, 2025, 11:30am

I think fly deploy –-detach is what I am looking for

jfent · August 8, 2025, 4:50pm

–detach doesn’t change the semantics of a deployment strategy at all, so it’s not a solution for the problem you stated.

Bluegreen will cordon old machines before destroying them, but I don’t believe it will wait for minutes for those machines to gracefully shutdown.

If your app cannot tolerate killing a streaming response and restarting it for a user when a deploy happens then you should probably write your own custom deployment code. The basic gist would be to create a bunch of new machines, cordon the old ones, somehow tell the cordoned machines that they need to shutdown and then poll all the cordoned machines, destroying them as each of them moves into the stopped state. Deploy is complete once all the old machines are destroyed.

Or you can do the really naive thing and cordon old machines, wait 30 minutes then destroy, under the assumption that all streaming will have finished after 30 minutes of being cordoned. This won’t work if you deploy really frequently, because bluegreen doubles the number of machines.

I believe all the code for bluegreen is actually in the flyctl repo so you can use that as the basis of your own strategy.

morse · August 9, 2025, 9:20am

I don’t think my use case is to exotic to need custom deployment code, what do you think instead if the max kill_timeout was increased? I opened a topic for discussing this here: Feature Request: increase maximum `kill_timeout` to 1 hour - #2 by khuezy

Topic		Replies	Views
Question on deployment lifecycle (wait before kill) Questions / Help	3	692	June 19, 2023
When does `fly deploy` return? Questions / Help flyctl	1	12	January 22, 2025
Extended maximum kill_timeout option	2	141	June 13, 2024
How do I eliminate the `~20s` downtime when running `flyctl deploy --strategy bluegreen`? Questions / Help	7	1431	December 29, 2023
Don't kill running machines during fly deploy ? Questions / Help	6	550	December 6, 2023

Do bluegreen deployments wait for machines to stop?

Related topics