Hello! I recently ran into an issue with one of my apps - I think a problem with the underlying machine and/or volume is responsible, and I’d like to find out what my options are (if any) in terms of data recovery.
The symptoms I ran into were:
- Trying to hit the application was resulting in 504s.
fly deployfailed to complete due to timeouts (fly deployprinted repeated 408 errors). I did afly machine destroyand retried to force creation of a new machine, but this had the same problem, leaving machines in acreatedstate.fly machine run node:22.10.0-slim --shell -v $VOL_ID:/datatimes out to try to create a machine by hand, with the same “machine stuck increated” behavior. Usingfly machine run node:22.10.0-slim --shellto create a machine without attaching a volume, or creating a machine and attaching a brand-new volume I just created, does work.- There are multiple other volumes for the app stuck in an
enabling_remote_exportstate - I’m guessing these correspond to snapshot creation. - The volume doesn’t seem to have any snapshots - it looks like the automatic snapshotting will still delete snapshots, even if new ones aren’t getting successfully created.

- Forking the volume did create a new volume, but it’s stuck in an
enabling_remote_exportstate.
I fortunately have a three month-old backup, so not all is lost - sadly my backup process was on a scheduled Fly machine that decided to just…stop running its daily schedule three months ago.
I did read the warning about using volumes, and I’m wishing I’d had better monitoring (guess what I set up yesterday!), but I’m hoping folks here can give me some ideas on anything I could try to recover my data. Thanks!