On 12/04/2013, at 6:52 AM, Keith Ouellette <keith.ouelle...@airgas.com> wrote:

> We am using Nagios for network monitoring and for Pacemaker, we are using 
> check_crm to provide status for nagios. This works very well if the failure 
> is still seen by Pacemaker at the time nagios polls. If the failure 
> condidtion causes a switchover, but the recovery is before the nest Nagios 
> poll, Nagios does not report any issue, because there isn't one at the time 
> of the poll, even if the transition of the resource changed to another node.
>  
> Is there a way to know that the transistion occured without scraping the 
> messages log in a set interval and looking for only new events that state a 
> transition or "Members Joined"/"Members Left". It appears that failcount only 
> increments on a failure, but not on a transition imposed by someone manually 
> puttintg  a node into standby.
>  
> I am just trying to figure out a way to trigger an alarm for Nagios any time 
> resources get moved between nodes, so the cause can be investigated.
>  
> Any ideas?

Run crm_mon in daemon mode and use --external-agent to write the notifications 
to disk somewhere (or push them into nagios)?

_______________________________________________
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org

Reply via email to