fly migrate-to-v2: Apps with volumes support 🎉

danwetherald · May 18, 2023, 2:32pm

Only way I got past this is by doing at fly scale count 1. It seems the old autoscale commands do nothing.

allison · May 18, 2023, 6:38pm

Hi! This was a bug on the backend, and should be fixed now. Sorry about that!

rodolfosilva · May 23, 2023, 8:14am

Is there any guide on how to migrate a nomad database to v2?
My database instance does not have a fly.toml, this instance was generated when I created my application.

@allison can you help me?

allison · May 23, 2023, 3:02pm

@rodolfosilva You can pull a fly.toml file from a running app with fly config save. That should be enough to get you migrating

harg · May 25, 2023, 11:27am

I’m trying to migrate my app to the v2 platform using migrate-to-v2.

It seems to have got stuck at

Waiting for nomad allocs for '<app-name>' to be destroyed

It just hangs forever. Any suggestions? The logs aren’t giving me much

Update:

It eventually timed out and rolled back:

==> Migrating matchhaus-prod to the V2 platform
>  Locking app to prevent changes during the migration
>  Enabling machine creation on app
>  Creating an app release to register this migration
>  Starting machines
INFO Using wait timeout: 2m0s lease timeout: 13s delay between lease refreshes: 4s
Updating existing machines in 'matchhaus-prod' with rolling strategy
  Finished deploying
>  Scaling nomad VMs down to zero now that machines are running.
Waiting for nomad allocs for 'matchhaus-prod' to be destroyed /
Waiting for nomad allocs for 'matchhaus-prod' to be destroyed -
Waiting for nomad allocs for 'matchhaus-prod' to be destroyed |
failed while migrating: nomad allocs never reached zero, timed out
==> (!) An error has occurred. Attempting to rollback changes...
>  Setting platform version to 'nomad'
>  Successfully recovered

Now, when I attempt to migrate again it errors saying it cannot parse the config. The config has not changed since the last attempt and it doesn’t give any information as to why the config can’t be parsed.

$ fly migrate-to-v2 -c fly/fly.prod.toml                                                                                                                                                                   [15:38:22]
Error: We had trouble parsing your fly.toml. Please check https://fly.io/docs/reference/configuration for more information.

nickolay.loshkarev · May 25, 2023, 2:37pm

I’m trying to migrate redis apps
flyctl migrate-to-v2 --config ./fly/fly.dev.redis.toml

or

flyctl migrate-to-v2 --config ./fly/fly.staging.redis.toml
and got the message

failed while migrating: unfortunately the worker hosting your volume vol_18l524yj5oj47zmp (redis_server_foodbank_dev) does not have capacity for another volume to support the migration; some other options: 1) try again later and there might be more space on the worker, 2) run a manual migration Manual migration to Apps V2, or 3) wait until we support volume migrations across workers (we’re working on it!)

billy · May 25, 2023, 3:00pm

Hi! Would you feel comfortable sharing your fly.toml? If not publicly, could you email support@fly.io? I’d like to take a look at why this is.

harg · May 25, 2023, 5:41pm

I actually got it working in the end. Seems like there was a problem on Fly’s end with the fly.toml as after the failed migration even running fly config save resulted in the “couldn’t parse config” error.

I had to do a fly deploy which seemed to fix the config. I then tryed migrate-to-v2 once more and this time it worked. The logs didn’t tell me much but the app is matchhaus-prod if you wanted to look further.

FWIW the app was crashing earlier today (after months of being super stable ) which I solved with another redeploy. Not sure if there’s been some other Fly issues today that haven’t surfaced.

billy · May 25, 2023, 7:20pm

Yep you’re right, this was a bug on our end that affected some configs that @kwaw fixed earlier

nickolay.loshkarev · May 26, 2023, 8:15am

# fly.toml file generated for fly-foodbank-dev-redis on 2021-09-16T13:16:30+04:00

app = "fly-foodbank-dev-redis"

kill_signal = "SIGINT"
kill_timeout = 5
processes = []

[env]

[experimental]
  allowed_public_ports = []
  auto_rollback = true

[[mounts]]
  source      = "redis_server_foodbank_dev"
  destination = "/data"

[[services]]
  http_checks = []
  internal_port = 6379
  processes = ["app"]
  protocol = "tcp"
  script_checks = []

  [services.concurrency]
    hard_limit = 25
    soft_limit = 20
    type = "connections"

 [[services.ports]]
    handlers = []
    port = "10000"

  [[services.tcp_checks]]
    grace_period = "1s"
    interval = "15s"
    restart_limit = 6

billy · May 26, 2023, 2:57pm

We have lower than usual capacity in the region your app is deployed in. I’m gonna copy @senyo’s great response to someone about this earlier this month:

One way around this is to create a volume in the same region and then do all the migration steps manually.
Otherwise, if you’re not a rush, you can retry the migration periodically. The worker may have capacity at a point in future and we’ll be able to run the migration successfully.
For what its worth, this issue is unique to the way we’ve designed volumes in that they are tied to a host, hence the issue. We’ve got ideas to make this work out of the box so in future, this shouldn’t be a problem.

danwetherald · May 31, 2023, 3:38pm

We are still not able to migrate any volume apps in ORD. Any updates?

allison · June 1, 2023, 4:19am

To migrate when your current host is full, you’ll need remote volume forking.
It’s coming along pretty well! We have cross-host volume forking working, but we still have to move state information through our stack so that flyctl can know when the destination volume is fully hydrated (and therefore safe to use). When we have volume forking ready, that’ll get its own Fresh Produce thread, but one of us will also post about it in here too.

It shouldn’t be that far out

MaPi · June 5, 2023, 7:45am

I just migrated my app to v2, and it went incredibly smoothly. It’s quite a simple app, but still, good job on making the migration process so straight forward guys!

danwetherald · June 13, 2023, 4:23pm

@allison - any update on the remote volume forking?

Thanks!

allison · June 13, 2023, 6:54pm

We have part of it shipped, but we’ve been intentionally quiet about it because there’s still an important piece missing. We have a hidden flag --remote-fork for fly vol fork which should (in theory) work, but we still haven’t finished getting volume status exposed properly. That part is what tells us when the copy is finished, or when the volume is safe to mount and use. (which means using that flag right now is very at-your-own-risk - be careful!)

We should have this properly ready soon! Once we can fork and maintain safety, the feature will truly be announced (including being hooked into migrate-to-v2 and all that)

danwetherald · June 15, 2023, 2:52pm

Thanks, we will stand by for the final version.

IanKoex1 · July 8, 2023, 4:23pm

I have migrated my app successfully and I has been working well for 4 days now.
When I run fly vol list I get two volumes, one has the -machines suffix.

After you’ve migrated your app, and verified that your app works, you can safely delete the old volumes without the -machines suffix.

When I try to destroy the old volume. I get the following warning:

Warning! Individual volumes are pinned to individual hosts. You should create two or more volumes per application. Deleting this volume will leave you with 0 volume(s) for this application, and it is not reversible.  Learn more at https://fly.io/docs/reference/volumes/

It says 0 volumes yet the one with the _machines suffix should remain. I am afraid I will lose my data if I press yes and I am racking up charges for having two volumes.
How do I ensure that the volume with _machines suffix remains and I don’t lose data?

andig · July 9, 2023, 10:31am

I wasn’t able to migrate a mini-app with volume:

ID                  	STATE  	NAME	SIZE	REGION	ZONE	ENCRYPTED	ATTACHED VM	CREATED AT
vol_jlgz1vpzl0e478m3	created	perm	10GB	ams   	8aba	true     	abc845a1   	1 year ago

Error: unfortunately the worker hosting your volume vol_jlgz1vpzl0e478m3 (perm) does not have capacity for another volume to support the migration; some other options: 1) try again later and there might be more space on the worker, 2) run a manual migration Manual migration to Apps V2, or 3) wait until we support volume migrations across workers (we’re working on it!)

Seems I’ll just have to wait before moving to v2?

Joaquim_Rego · August 22, 2023, 10:39am

I just attempt to migrate, but got the error Error: failed to create volume: size_gb must be between 1GB and 500GB, despite the fact that my volumes are 50GB each (two volumes).

I’m considering ditching this cluster and just re-create a new one and dump/restore data. Is that my best way to go?

Topic		Replies	Views
fly migrate-to-v2 - postgres edition Fresh Produce	51	2556	November 2, 2023
fly migrate-to-v2 - Automatic migration to Apps V2 Fresh Produce appsv2 , machines	62	12936	July 29, 2023
Feedback on `fly migrate-to-v2`	13	770	May 23, 2023
apps v2 wishlist wishlist	3	366	April 9, 2023
Manual migration to Apps V2 Fresh Produce docs , appsv2	19	2433	April 5, 2023

fly migrate-to-v2: Apps with volumes support 🎉

Related topics