RE: Rules report

2007-04-19 Thread Bowie Bailey
Matt Kettler wrote: > > If you want a "top x rules" list, sa-stats can do that for you: > > http://www.rulesemporium.com/programs/sa-stats.txt > > It will parse a spamd logfile and report the most-frequently used spam > and nonspam rules (and you can configure how many it will list for > each)

Re: Rules report

2007-04-19 Thread Craig Carriere
I utilize amavisd-maia (Maia Mailguard) which provides updated rules stats.  The program also provides an easy method to constantly train your bayes filters.  You might want to take a look at it. Best Robert Fitzpatrick wrote: On Thu, 2007-04-19 at 15:03 +0100, Chris Lear wrote:

Re: Rules report

2007-04-19 Thread Robert Fitzpatrick
On Thu, 2007-04-19 at 15:03 +0100, Chris Lear wrote: > * Matt Kettler wrote (19/04/07 14:49): > > If you want to know how accurate a particular rule is, by comparing the > > spam vs nonspam hit rates, those stats are useless, because of the bias. > > You need a manually sorted corpus to get this k

Re: Rules report

2007-04-19 Thread Chris Lear
* Matt Kettler wrote (19/04/07 14:49): Matt Kettler wrote: If you try to build it off a live feed and use SA's marking as the spam criteria, your statistics are useless. Any rule with a high enough score would get "perfect" results.. all the mail it matched would be spam, and no nonspam. You hav

Re: Rules report

2007-04-19 Thread Matt Kettler
Matt Kettler wrote: > If you try to build it off a live feed and use SA's marking as the spam > criteria, your statistics are useless. Any rule with a high enough score > would get "perfect" results.. all the mail it matched would be spam, and > no nonspam. You have, essentially, created a "self fu

Re: Rules report

2007-04-19 Thread Matt Kettler
Robert Fitzpatrick wrote: > I've seen some others on the list here show reports of the different > rules and how much they hit. Most of them are quoting the ones out of the official ruleset mass-check results. Those are in the tarball under the rules directory as STATISTICS*.txt > How can I produc

Rules report

2007-04-19 Thread Robert Fitzpatrick
I've seen some others on the list here show reports of the different rules and how much they hit. How can I produce these reports? And is it possible to produce a report like this by domain name? -- Robert