I've been thinking about this some more after some internal discussion and one thought I came up with would be to have our Zenoss monitoring server only monitor our policy servers. Right now I have two policy servers with three more to be built, giving us a separate policy server for each of the "runtimes" in our environment (e.g. dev, test, gold).
Since I don't have any way to directly access the contents of the tcdb files (namely cf_LastSeen.tcdb), I had the thought of having each policy server generate something like a lastseen flat file of all hosts not seen recently (say 10-15 minutes). However, the hostsseen() function operates on a granularity of hours, so the best I could do would be hostsseen("1", "notseen", "name") to generate a list of hosts not seen in the past hour, and even one hour is too long to wait. I'm thinking of submitting an RFE along the lines of increasing the granularity of hostsseen() or providing another function to avoid breaking compatibility, but I first wanted to ask if anyone has any further suggestions based on what I've discussed here. One further point: Apparently it would take a custom script to have Zenoss determine if there are more than 2 cf-execd processes (including the need to run ps -fz global on Solaris global zones to prevent it from also seeing any non-global zones' processes). I don't like the thought of having our Zenoss server have to run an external script against hundreds of systems, since the extra load and time to complete could be significant. Thanks, Justin -----Original Message----- From: help-cfengine-boun...@cfengine.org [mailto:help-cfengine-boun...@cfengine.org] On Behalf Of Justin Lloyd Sent: Friday, April 30, 2010 12:48 PM To: fo...@cfengine.com; help-cfengine@cfengine.org Subject: RE: Cfengine Help: Re: Monitoring Cfengine processes Thanks for the responses everyone. I'm going to discuss the options with my colleagues and our operations center that manages our Zenoss configuration. Justin -----Original Message----- From: help-cfengine-boun...@cfengine.org [mailto:help-cfengine-boun...@cfengine.org] On Behalf Of fo...@cfengine.com Sent: Wednesday, April 28, 2010 10:53 AM To: help-cfengine@cfengine.org Subject: Cfengine Help: Re: Monitoring Cfengine processes Forum: Cfengine Help Subject: Re: Monitoring Cfengine processes Author: neilhwatson Link to topic: https://cfengine.com/forum/read.php?3,17005,17021#msg-17021 If the host is up and the CF policy is sufficiently robust then no serious monitoring should be needed. A simple TCP port check would be a nice to have but make sure that its threshold allows for Cfengine to recover before alarms go off. _______________________________________________ Help-cfengine mailing list Help-cfengine@cfengine.org https://cfengine.org/mailman/listinfo/help-cfengine This electronic communication and any attachments may contain confidential and proprietary information of DigitalGlobe, Inc. If you are not the intended recipient, or an agent or employee responsible for delivering this communication to the intended recipient, or if you have received this communication in error, please do not print, copy, retransmit, disseminate or otherwise use the information. Please indicate to the sender that you have received this communication in error, and delete the copy you received. DigitalGlobe reserves the right to monitor any electronic communication sent or received by its employees, agents or representatives. _______________________________________________ Help-cfengine mailing list Help-cfengine@cfengine.org https://cfengine.org/mailman/listinfo/help-cfengine _______________________________________________ Help-cfengine mailing list Help-cfengine@cfengine.org https://cfengine.org/mailman/listinfo/help-cfengine