I am setting up an uptime check in Google cloud monitoring. When I add the check I also create an alert policy. Furthermore I set my check to global which means it will check from 6 different locations and I set the check frequency.
The alerting policy that was created (in cloud console when I created uptime check) has a rolling window set to 20 minutes and Rolling window function set to "next older".
I am trying to understand this
- Rolling window 20 min: Does that mean I will get one alert every 20 min or what is affected by this 20 minute window?
- next older: Does this mean it somehow uses data from the previous 20 minute slot? I don't understand this one.
Finally under Configure trigger there is also
Retest window: Does it do any retest in that period or does it just hold off on alerting for that amount of time to confirm it did not resolve?
This official doc provides two different examples that describe when the condition is met. The following content is taken from official docs
Rolling window:
When using a rolling time window (or real time window), the metrics are calculated or analyzed for a particular alignment period, for example if you consider an alignment period of five minutes, at 1:00 PM, the alignment period contains the samples received between 12:55 PM and 1:00 PM. At 1:01 PM, the alignment period slides one minute and contains the samples received between 12:56 PM and 1:01 PM.
The first graph illustrates a combined case with an alignment period set to 5 minutes and a duration window of 3 minutes. The example illustrates when the aligned value exceeds the threshold and when the condition is met.
Retest window:
Even though the aligned value is greater than the threshold at time start + 2 minutes, the condition doesn't trigger until the aligned value is greater than the threshold for three minutes. That event occurs at time start + 5 minutes.
Example: