Weird. Your app has 0 restarts and it says it’s running but it’s “dead” on the dashboard?
Anyway, it seems you’re on the nomad platform, i’ll suggest you migrate to the machines platform.
See this post on how to migrate fly migrate-to-v2 - Automatic migration to Apps V2
- is there a way to configure log-shipper to restart when it dies
After the migration is done, follow this https://fly.io/docs/apps/migrate-to-v2/#make-sure-the-machines-restart-policy-suits-your-app to learn how to configure the restart policy for your machines.
- is there any built-in alerting to detect dead services like this?
Not that i’m aware of. But, the machines platform has a public api that you can use to setup your own uptime checks. Working with the Machines API · Fly Docs
- does anyone have any tips for diagnosing an outage like this? The logs stored on fly (e.g.
flyctl logs
) do not go back far enough to cover the incident.
If you share the name of your log shipper app, maybe i can check and see if i’ll see anything weird. I can’t promise i’ll find anything, but i can look