Allow specifying --vm-size when running legacy Postgres backup restore

jechol · September 26, 2025, 8:40am

We’re still operating a small-scale service, so our production database runs on shared-cpu-2x.
When I run fly pg backup restore in this state, the restore process never completes because it times out before finishing, as shown below:

2025-09-26T08:35:43Z app[0802d76a3554d8] nrt [info]panic: failed to handle remote restore: failed to monitor recovery mode: timed out waiting for PG to exit recovery mode
2025-09-26T08:35:43Z app[0802d76a3554d8] nrt [info]goroutine 1 [running]:
2025-09-26T08:35:43Z app[0802d76a3554d8] nrt [info]main.panicHandler({0xa0d180, 0xc00018a0b0})
2025-09-26T08:35:43Z app[0802d76a3554d8] nrt [info]	/go/src/github.com/fly-apps/fly-postgres/cmd/start/main.go:190 +0x4a
2025-09-26T08:35:43Z app[0802d76a3554d8] nrt [info]main.main()
2025-09-26T08:35:43Z app[0802d76a3554d8] nrt [info]	/go/src/github.com/fly-apps/fly-postgres/cmd/start/main.go:67 +0xe65
2025-09-26T08:35:43Z app[0802d76a3554d8] nrt [info] INFO Main child exited normally with code: 2

Because the restore VM defaults to the same shared-cpu-2x class, it quickly gets throttled during WAL replay.
It would be very helpful if we could specify --vm-size (for example, performance-1x) when invoking fly pg backup restore, so that restores can run on a more powerful instance without being constrained by the production VM’s size.

eli · September 26, 2025, 2:24pm

hi– yes, I agree! I think the sticking point here is that there isn’t a way to add memory to existing machines, which you’d need to use in order to attach to the existing volumes. You’ve probably already seen fly pg create --vm-size, which is probably the next-best option. But all the plumbing for this lives in public repos, and it’s quite possible I’m missing a good solution for you:

jechol · September 26, 2025, 10:43pm

I understand that you’re suggesting something like
fly pg create --vm-size [vm-size] --snapshot-id [snapshot-id], right?

Right now I’m trying to do PITR using
fly pg backup restore --restore-target-name,
so I don’t think that option will work for me.

jechol · October 1, 2025, 3:25am

I understand that legacy Postgres has been deprecated since the launch of Managed Postgres. However, the fact that flyctl pg backup restore with PITR effectively fails due to CPU balance exhaustion seems like a critical issue, even for a deprecated product. Is there any plan to fix this?

system · October 8, 2025, 3:26am

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.