In order to move to a hard non-ok state, a service can require multiple (i.e. 
max_check_attempts) unsuccessful runs of the check plugin.
Is there a way to require multiple successful plugin runs in order to return to 
a (hard) ok state?
I have printers where, if toner is low (not empty), from time to time, the 1104 
entry vanishes from the Printer Alert Table for several seconds, only to return 
at another index shortly after.
As I'm using a combination of SNMP traps and table polling, the trap sent upon 
removal immediately make my printer-toner service switch to OK, after which it 
takes some time (one trap sent upon table re-apperance and two polls) to return 
to WARNING. Even if I remove traps from the equation, my table poll may happen 
during the some-second period the 1104 is gone.

I do know about flapping, but that's not what I want. I want protection from 
transient success, like I'm protected from transient failures.
_______________________________________________
icinga-users mailing list
icinga-users@lists.icinga.org
https://lists.icinga.org/mailman/listinfo/icinga-users

Reply via email to