Am Montag, 14. September 2009 11:36:45 schrieb Erik Hensema / HostingXS: > Hi everybody, > > I'm in the process of deploying my first pacemaker cluster. The basic setup > seems fine. > > But of course I want to monitor the cluster for intermittent failures. If a > resource crashes and is restarted automatically, I want to know about it. > Especially when it's repeatedly restarted. > > Which brings me to the question how to monitor a cluster? Without a human > tailing syslogs 24x7, that is ;-) > > I'm already running nagios, so it would be preferable to integrate the > monitoring in nagios.
pacemaker-mgmt includes a nice SNMP subagent the connects to net-snmp. It lists the state of all resoures and especially knows abount all failcounters. nagios can use check_snmp to check failcounters. You also could sum about all failcounters. Greetings, -- Dr. Michael Schwartzkopff MultiNET Services GmbH Addresse: Bretonischer Ring 7; 85630 Grasbrunn; Germany Tel: +49 - 89 - 45 69 11 0 Fax: +49 - 89 - 45 69 11 21 mob: +49 - 174 - 343 28 75 mail: mi...@multinet.de web: www.multinet.de Sitz der Gesellschaft: 85630 Grasbrunn Registergericht: Amtsgericht München HRB 114375 Geschäftsführer: Günter Jurgeneit, Hubert Martens --- PGP Fingerprint: F919 3919 FF12 ED5A 2801 DEA6 AA77 57A4 EDD8 979B Skype: misch42 _______________________________________________ Pacemaker mailing list Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker