Hey, we’re trying to understand why we’re having 502s. We have metrics from Fly about it, and we think logs too, but we’re not sure why. Here’s an example:
Questions:
- Why is the log output messed up with leading
error.message
that breaks the format? Is that normal? - For
timed out while connecting to instance
what is the timeout limit? Does it retry? How many times? - Should request.id be used for request tracing?
- I see logs with app/runner/proxy following by an ID (?) in brackets. Does the ID relate logs between proxy/runner/app?
- Is there a correspondence of one 502 response for every error see in these logs? Meaning here we’d expect 3 502s to be recorded by metrics (yet our metrics suggest there was only one 502 so I’m confused)
- While the app was booting around the 17:53:22-53 time range we got an error. Is that related or coincidence? If it’s related, how is a restart/deployment supposed to be made in a flawless way? Is it even possible?
Thanks in advance for helping answer these questions!