On 6/2/13 9:12 AM, Wietse Venema wrote:

> For example, periodically send email to mailboxname+timest...@example.com,
> and parse the "to=<mailboxname+timest...@example.com>" and
> "status=delivered" out of the logfile record stream.

Or, even better, try to retrieve those messages from the actual mailbox
using POP3. That way you've simulated everything a user does, end to end.

This is not hard to do. Using the Perl modules Net::SMTP and Net::POP3,
it's easy to write a Nagios/Icinga check that operates thusly:

 1. Using POP3, check for a message with a timestamp in the
    subject that indicates that it was sent in the last X
    minutes (where X is the maximum delay you're willing to
    tolerate). If not found, the result will be CRITICAL.

 2. Using POP3, delete every message in the mailbox.

 3. Using SMTP, send a new message to the mailbox with a
    timestamp in the subject.

That's all it takes. Each time the check runs, (1) should find the
message sent by (3) of the previous check. The "timestamp in the
subject" can be as simple as the epoch seconds.

As I wrote on this list 11 years ago(!), "In general, the thing I've
learned about monitoring is that when possible, check the system by
using it, rather [than] looking for changes in side effects (such as
logs, or number of processes running, etc.). That's not to say that the
other things can't give useful information, too, but if you can check
the system by using it, you don't have to worry so much about whether
you've added a regexp for every possible log entry and so forth."

-- 
Robert L Mathews, Tiger Technologies, http://www.tigertech.net/

Reply via email to