This may be kinda simple for you gurus out there, in which case I welcome your feedback and suggestions to make this better. But if anyone finds this useful...great!

I wanted a stats tool that would tell me what rules were hit on the most. Which ones ONLY trigger on spam and which ones ONLY trigger on HAM? I wanted to know what percentage of my HAM was whitelisted. Do I have my rule scores set high or low enough and do I have the required score for the SPAM threshold at the right place? I wanted something that was flexible and powerful. So I thought about ways to get my spamassassin data into mysql. Look at this screenshot and you'll get the idea:

http://pelorus.org/pictures/mailstats.gif

Obviously, with that type of granularity, I could generate any kind of report I wanted. The way I do it is I generate a few custom headers in procmail to make things easier, and I have a couple of special SA headers added, again, to make things easier. Then I pipe a carbon copy of each email through this bash script which parses it and puts all the data into mysql. I just finished it today, so I don't have any pretty charts or anything yet, but I do think it will meet my needs.

I did look at some of the other data collection utilities out there, but I didn't see any that were quite this flexible, if I do say so myself. Perhaps I am mistaken and there is one (or more) that can do what this does and more.

Here's the script, along with many (helpful, I hope) comments.
http://pastebin.com/f743e7daa

Like I said, if any of you smart guys out there see ways to improve this, I sure would appreciate the feedback.

Thanks.

Skip

--
Get my PGP Public key here:
http://pelorus.org/[EMAIL PROTECTED]

Reply via email to