My apps are all of a sudden not responding at all, looks like they’ve been down for about 4 hours (I haven’t touched them since Friday when they were working fine). My error monitor shows nothing, and the fly dashboard says they are pending.
If I try to redeploy them I get stuck on Running release task (pending)....
Any direction on what I can try would be massively helpful, I have customers that are unable to use our platform right now. (I have also emailed my dedicated support but not heard anything)
2022-10-03T05:05:34Z app[54e4b028] ewr [info]05:05:34.385 request_id=[REDACTED] [info] Sent 200 in 919µs
2022-10-03T05:07:42Z runner[54e4b028] ewr [info]Shutting down virtual machine
2022-10-03T05:07:42Z app[54e4b028] ewr [info]Sending signal SIGTERM to main child process w/ PID 516
This is ONLY happening for me on the EWR region. If I do any other region my app deploys fine. It looks to me like there’s some sort of outage in part of the EWR region?
My database in EWR is doing fine for now, although I’m sweating bullets over here…
I’m seeing this in my app as well (also in EWR). I can’t redeploy and it’s stuck in pending:
$ fly status --all
App
Name = whatgotdone
Owner = personal
Version = 166
Status = pending
Hostname = whatgotdone.fly.dev
Deployment Status
ID = 83c510bf-3b7b-64b4-9eb6-8f5e9a2db012
Version = v166
Status = running
Description = Deployment is running
Instances = 1 desired, 0 placed, 0 healthy, 0 unhealthy
Instances
ID PROCESS VERSION REGION DESIRED STATUS HEALTH CHECKS RESTARTS CREATED
890943ac app 165 ewr evict complete 1 total, 1 passing 0 1h40m ago
b3aa6eb6 app 165 ewr evict complete 1 total, 1 passing 0 2022-09-23T02:38:54Z
Any update here? Our primary db is in EWR, and still running, but the increased roundtrip latency is not great right now between our app and db when running write heavy loads.
Also, @kurt I love fly but I’m taking a ton of heat from repeated outages where the status page never gets updated or gets updated very late so it’s making me look lost. I know it’s probably not malicious on your end but it feels like a trend now where the status page just isn’t up to speed, it’s been over 12 hours since my apps in EWR started crying for help.
I’m going to update the status page right now. The reason you’ve seen us waffling about this is that EWR isn’t really having an outage so much as that it’s been at/near capacity (we have some customers that really hammer it with hundreds of apps). We’re done waffling, a problem is a problem, we’ll keep you posted.