Hi! I’m running a application on fly and I’m making a few changes, however in my Activity Page Fly.io is releasing about every minute, I’m now on v959 (I think I was on v50 or something)?. Seems like a issue on your side, I’m just reporting as it seems it has stopped.
However, all health checks are passing, and I have only 1 application configured.
I was using GRU last week but changed to SEA, for some reason it’s trying to deploy to GRU, I think. Which is odd, as I don’t have it set in my regions anymore. Regions [app]: sea
Hmm I wonder if the machine is stuck in a bad state. Can you try to clone the machine to the same region, then once that has completed, delete the old machine.
Unfortunately didn’t work. Just made a new deploy to see if it fixed itself but its still releasing. But the ‘Monitoring’ tab doesn’t show GRU anymore, only SEA. I’m on V1256 (well, not really as flyctl releases show something else). Any other ideas? I’m thinking about creating a new app and migrating everything else there, but not sure though…
Fly has stopped the Releases on v1299. If it comes back it’ll come back now, since I just deployed that fly.toml format…
I had auto_rollback enabled earlier too. Disabled it a few hours ago, I think (when the problem started).
edit: Just noticed the UI mentioning to add checks. I’ll have a look at that config.
edit2: Oh it mentioned Setup checks because I removed them from my old fly.toml. Just fixed it.
Well, it seems to have stopped at v1299, monitoring screen is still talking about the failed deployment for v1299 though.
Nothing sticks out to me that would cause this, I would bet that the app is stuck in a bad state and fly will continue to think it needs to be deployed until its corrected either manually by the team or by knocking it back into a good state via a series of commands unfortunately.
Might be worth just creating a new app, running the ENV script from the upgrade guide to copy over ENVs and then swap out DNS to the new apps IPs.
well, the app is running successfully… but I think I’ll move everything into another one because that huge activity is really weird. But it did stop releasing though. I think it starts when I set env vars (haven’t set one in a while) but can’t be sure.
OK, so that’s strange. Ended up recreating the application with the same fly.toml but only a different app name and now its happening again. Only deployed one time yesterday. Now on V1600… really strange.
Doesn’t seem to be impacting anything, but it’s a really strange behavior.
@brenol Could you please describe the steps you used to deploy the app?
For some reasons, your app has both Machine instances and a nomad config. Nomad tries to create allocations, fails, but this leads to constantly increasing release number.
This looks like a bug on our side and I’m trying to understand how this could have happened.
After the deploy was complete, I went and updated my Cloudflare configuration with the new IPs.
I noticed I had no checks set-up (as I migrated from the [[services]] to the [http_service] block, so I added the http_check and tcp_check to my fly.toml.
Finally, I ran flyctl machines destroy <old_machine_id>.
I think thats all I did. Hope it helps to troubleshoot!
@brenol I can’t seem to reproduce with these steps. Just to double check, have you enabled autoscaling on the app (fly autoscale set min=<min> max=<max>)? It looks like enabling autoscaling on a Machine app leads to exactly this behavior.
Oh. Yes! I did enable autoscale on the machine. Sorry about that, forgot to mention it as I had to search on my shell history to identify all used commands and I completelly missed it.
So, running flyctl autoscale disable on my side should fix cause a fix, correct? I’ll have a look on how I should run autoscale on apps-v2 then. Thank you @pavel.
I think there is currently another bug that prevents it from being disabled. So a quicker way would be to re-create the app (you mentioned it’s not yet in production, right?).
I’ll have a look on how I should run autoscale on apps-v2 then.
For this you would need to pre-create additional machines and make sure that you have auto_stop_machines = true and auto_start_machines = true in fly.toml