For about 5 minutes our fly machines were not responding and our team was unable to access our fly dashboard. Our health checks failed repeatedly and requests were timing out.
Everything has been running fine since that incident.
Since I can’t find any reports on this, did anyone experience the same issue? Could this be just our fly machines and if so how can we prevent this in the future?
I believe that’d explain some of the machine connectivity errors you saw. For the dashboard errors if you were hitting an impacted edge in LHR (or one of the other regions) then it’s possible the same issue would have impacted routing to the main fly app.