So this morning I got an alarm about increased latency from my application:
I tried visiting the fly.io website and it seemed to be down as well.
The status page worked and I got this:
The application was only running in the
iad region which is probably the reason why it went down.
That being said it would be nice to have advanced notice of “scheduled maintenance” before an application goes actually down.
What worries me as well is that the issue on the status page started at 00:07 CST, but increased latency started actually more than an hour earlier at 22:50 pm CST.
Can you confirm that simply scaling the application to 3 or more instances across different regions would have avoided this issue? Will the fly proxy automatically stop serving instances with increased latency or that are down?