Deployments fail immediately due to unhealthy allocations

byhemechi · February 25, 2023, 11:47pm

What it says in the title. Watching the logs shows no machine is spawned to even be unhealthy

byhemechi · February 25, 2023, 11:52pm

new error output:

Recent Events
TIMESTAMP               TYPE            MESSAGE                                                                   
2023-02-25T23:51:28Z    Received        Task received by client                                                  
2023-02-25T23:51:28Z    Task Setup      Building Task Directory                                                  
2023-02-25T23:51:31Z    Driver Failure  rpc error: code = Unknown desc = could not fulfill runtime memory request
2023-02-25T23:51:31Z    Not Restarting  Error was unrecoverable                                                  
2023-02-25T23:51:31Z    Alloc Unhealthy Unhealthy because of failed task

byhemechi · February 26, 2023, 12:04am

Seems to only be an issue with nomad apps, i destroyed the app and recreated it with machines and all is working

joao · February 26, 2023, 2:15am

Also happening with my nomad apps.

Managed to deploy by, instead of using a ‘bluegreen’ strategy, choosing ‘immediate’:
fly deploy --remote-only --strategy immediate --config fly.toml

vivek-guerdon · February 26, 2023, 4:35am

Hi,

I am blocked on this since yesterday, not able to deploy my app for the same reason.

codearchitect · February 26, 2023, 5:02am

Adding --strategy immediate fixed it for me too, thank you.

anatoliy · February 26, 2023, 10:02am

--strategy immediate fixed it for me as well.

The issue is easily reproducible with Remix Run app:

Install flyct and authenticate it
Create a new Remix Run app npx create-remix@latest: choose “Just the basic”, “Fly.io”, “JavaScript”, agree to npm install
cd into the app folder and initialize Fly.io flyctl launch: choose all defaults and agree to deploy
Change literally any symbol in, say, app/routes/index.jsx of the app, try flyctl deploy and see failure
Revert the change, try flyctl deploy and see success

I guess there is probably an even easier way to reproduce, it doesn’t seem to be tied to Remix, it’s just I’m sure this one is stable.

joep · February 26, 2023, 10:45am

I’m also unable to deploy since yesterday (last succesful deploy 15h28m ago), but I get no errors. It just fails on release with:

--> release v20 created

--> You can detach the terminal anytime without stopping the deployment
==> Release command detected: /app/bin/migrate

--> This release will not be available until the release command succeeds.
Error release command failed, deployment aborted

Adding --strategy immediate doesn’t change anything so it might be another cause. Any suggestions?

fly v0.0.470
Platform = nomad

vdr · February 26, 2023, 10:55am

Seeing the same on my postgres app, I can’t use the immediate strategy there, can’t restart it, can’t scale it… Truly disappointed with all the problems past weeks.

vivek-guerdon · February 26, 2023, 11:24am

Waiting for the fix as I am blocked

Yaeger · February 26, 2023, 12:49pm

Similar issue here in ams. --strategy immediate works, but moving region also seems to work.

==> Creating release
--> release v79 created

--> You can detach the terminal anytime without stopping the deployment
==> Release command detected: php artisan migrate --force

--> This release will not be available until the release command succeeds.
Error release command failed, deployment aborted

joep · February 26, 2023, 2:04pm

Changing region (and creating a new db) worked for me.

vivek-guerdon · February 26, 2023, 3:07pm

Which region is working? I tried sin and it didn’t work.

joep · February 26, 2023, 3:14pm

ams is NOT working, but lhr and cdg are. lhr seems a bit slow.

JP_Phillips · February 26, 2023, 3:29pm

Hi all, we’re working to provision more capacity in ams and are also adjusting resource allocation. Feel free to follow Fly.io Status - Capacity Issues in AMS region for updates.

Topic		Replies	Views
Not able to deploy	3	332	February 26, 2023
Help with deploy error: Failed due to unhealthy allocations Questions / Help	7	781	April 25, 2023
Deployment suddenly fails: Failed due to unhealthy allocations, but it's listening on [::]:8080 Questions / Help	3	463	December 25, 2022
deploys failing due to "unhealthy allocations" Questions / Help	4	2072	October 26, 2022
Deployment and rollback failed with: Failed due to unhealthy allocations Questions / Help	4	385	March 1, 2023

Deployments fail immediately due to unhealthy allocations

Related topics