kill_timeout not working as expected

jivanovic · August 22, 2023, 9:39am

Hello Fly Community I have 2 app instances with performace-2x machines running. I configured kill_timeout inside of the fly.toml but noticed that it does not respect the value I set when downscaling an instance. I used a kill_timeout value of 300 (5 minutes), however once the downgrading signal is sent the instance shuts down instantly. Should I also set a kill_signal value?

Here is my fly.toml file:

app = "..."
primary_region = "iad"
kill_timeout = 300

[build]

[http_service]
  internal_port = 3000
  force_https = true
  auto_stop_machines = true
  auto_start_machines = true
  min_machines_running = 0
  processes = ["app"]

Here is a screenshot of the shutdown process logs. As you can see the instance is shut down as soon as the signal is sent.

roadmr · August 22, 2023, 1:30pm

Hi there!

The log seems to indicate your kill_timeout change didn’t get applied - the interval between SIGINT (signal the application to shut down gracefully) and SIGTERM (ok, you didn’t shut down gracefully, now just go away) is 5 seconds, which is the default.

If you can get your app to respond to SIGINT and shut down gracefully though, that’s definitely the best option kill_timeout is there in case your app’s shutdown takes more than 5 seconds.

Did you fly deploy when you changed the kill_timeout configuration? You can double-check the effective configuration with fly config show.

Daniel

jivanovic · August 22, 2023, 3:32pm

Hey @roadmr Thank you for the reply, I really appreciate it!

I would prefer to use kill_timeout since it is somewhat specific to my use case. You can read more about it in this thread Instance Downscaling

I checked the fly config show and it looks like kill_timeout should be applied, however I am still testing it and it does not seem to be working… Here is the fly config show:
Screenshot 2023-08-22 at 17.29.46

Do you have any further ideas on why that could be?

One more thing, the Billing Dashboard is constantly loading for me today and I cannot get the anticipated monthly charge. I checked the statuses and everything seems to be working on your part but the Billing Dashboard never seems to load fully

MatthewIngwersen · August 22, 2023, 9:19pm

Hey @jivanovic—after looking into it further, we discovered a backend bug where the kill_timeout was being ignored when a machine was fully stopped. The patch is being rolled out, so hopefully this will work for you now.

Thanks for the report, and if this doesn’t seem to fix the issue for you, please let us know!

(The dashboard issue should be resolved now too.)

jivanovic · August 23, 2023, 8:04am

Hello @MatthewIngwersen The kill_timeout is applied as expected! Thank you for the fast patch, you guys are the best

The dashboard issue is also resolved

system · August 30, 2023, 8:04am

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Fly.toml configuration does not set kill_signal machines , duplicated	12	62	November 7, 2024
autostop machine - virtual machine exited abruptly Questions / Help	5	893	July 11, 2023
fly.toml kill_signal and kill_timeout Questions / Help	3	133	August 14, 2024
Question on deployment lifecycle (wait before kill) Questions / Help	3	688	June 19, 2023
New Feature: Graceful VM shutdown options	7	1749	February 17, 2021

kill_timeout not working as expected

Related topics