Metrics oddness

OldhamMade · June 21, 2021, 7:13pm

FYI, noticed an odd error when viewing metrics for 2 days:

kurt · June 22, 2021, 3:43am

This is a weird bug we’ve noticed when VMs restart in a burst. It resets counters and the way we push metrics seems to handle that badly. @steveberryman would know what’s up, I think.

steveberryman · June 22, 2021, 6:16pm

This is odd! I’ll have a look in to it.

OldhamMade · June 24, 2021, 7:48am

Seems that, regardless of app (I have several), the metrics displays for 1 and 2 days have a single, huge spike.

Once this is fixed, would it be possible to get a 7-day view also?

kurt · June 25, 2021, 1:25am

I think @jerome and @steveberryman figured this out, and then fixed it. When we deploy our load balancers, we bring up a new instance and keep the old one running for a very long time while connections drain. During deploys, two or more instances were sending metrics with the same name + labels, so the values would flap between 0 and “very high” for a couple of minutes.

They fixed it by adding a proxy_id label to dedupe them.

Topic		Replies	Views
No http status codes etc data on metrics page	8	353	March 8, 2022
Metrics down?	4	373	September 20, 2021
Rendering of metrics is wrong for some longer duration graphs	1	629	September 16, 2020
VM Service Concurrency: what do these numbers mean? Questions / Help	3	458	October 7, 2021
Metrics down 502 errors metrics	3	122	April 19, 2024

Metrics oddness

Related topics