Hi, Why isn't the mod_gearman the #1 hit for "nagios latency" or "nagios scaling"? It should be. Seems to be common knowledge that Nagios can only handle a certain number of hosts and services but I had never heard "just install mod_gearman and the limits are *much* higher."
We knew our Nagios server had too many hosts and services (around 520 and 11000) and other fires had been delaying a split. Splitting was going to be a pain. Then one of us stumbled upon mod_gearman, an NEB module which distributes checks via Gearman. We spent about 1.5 hours installing Gearman and mod_gearman on our dev Nagios server (and nowhere else) and our latency instantly went from way-too-high to essentially 0. Load average stayed about the same as before. We spent 30 minutes putting it all in Puppet and on our production Nagios box with the same results. "Way-too-high" latency for us was as bad as 5 minutes which is terrible when it takes 3-4 checks before a notification is sent. It is a silver bullet! It's amazing. Wow! At least, we haven't had any problems after 2 days of it being in production. What could go wrong? :) mod_gearman: http://labs.consol.de/lang/de/nagios/mod-gearman/ Gearman: http://danga.com/gearman/ _______________________________________________ Tech mailing list Tech@lists.lopsa.org https://lists.lopsa.org/cgi-bin/mailman/listinfo/tech This list provided by the League of Professional System Administrators http://lopsa.org/