My project deploys itself using Github Actions for CI/CD with flyctl-actions.
I’m finding that the deploy step of my workflow is hanging indefinitely and I have to kill the job manually. The app is still deployed successfully - the deploy step just seems to get stuck or never exit.
The actual logs and the output of fly checks list would help a lot debugging this — would you be able to post them here? There are a couple reasons this could happen but hard to know without the logs.
Based on the fact that the app is still deployed correctly I’m guessing one of the checks is taking too long to go green, we can confirm that.
$ fly -c fly/fly.staging.toml checks list [16:57:24]
Health Checks for <my app>
NAME STATUS ALLOCATION REGION TYPE LAST UPDATED OUTPUT
382404c10e0d588515ce107c8ca067ea passing 5349cae3 lhr HTTP 53m22s ago HTTP GET
http://172.19.1.170:4000/healthz:
200 OK Output: howdy 👋
This looks fine, can we look at the logs? There might be a clue there.
In both local and Github based deployments, you’re first pushing an image into the Fly registry and then just initiating a deploy with that image tag, right? And on GitHub actions that command does not terminate within a reasonable amount of time, but it does locally?
Just chiming in to say that I’m seeing the same issue today. AFAICT, the deploy is happening and the activity tab on my app shows a new release. It’s just that the command in the github action isn’t exiting, so it hangs indefinitely.
==> Creating release
1831
Release v250 created
1832
Release command detected: this new release will not be available until the command succeeds.
1833
You can detach the terminal anytime without stopping the deployment
==> Release command
Command: /app/bin/enaia eval Enaia.Release.migrate
Configuring virtual machine
Starting virtual machine
Starting init (commit: 7943db6)...
2021/11/10 17:42:11 listening on [fdaa:0:309a:a7b:ab8:5083:6bf8:2]:22 (DNS: [fdaa::3]:53)
Reaped child process with pid: 562 and signal: SIGUSR1, core dumped? false
17:42:14.162 [info] Migrations already up
Main child exited normally with code: 0
Error: The operation was canceled.
The latest flyctl release includes a fix for release commands hanging. The underlying issue was a delay in logs holding up the release monitoring code.