Matt Kettler wrote:
>
> If you want a "top x rules" list, sa-stats can do that for you:
>
> http://www.rulesemporium.com/programs/sa-stats.txt
>
> It will parse a spamd logfile and report the most-frequently used spam
> and nonspam rules (and you can configure how many it will list for
> each)
I utilize amavisd-maia (Maia Mailguard) which provides updated rules
stats. The program also provides an easy method to constantly train
your bayes filters. You might want to take a look at it.
Best
Robert Fitzpatrick wrote:
On Thu, 2007-04-19 at 15:03 +0100, Chris Lear wrote:
On Thu, 2007-04-19 at 15:03 +0100, Chris Lear wrote:
> * Matt Kettler wrote (19/04/07 14:49):
> > If you want to know how accurate a particular rule is, by comparing the
> > spam vs nonspam hit rates, those stats are useless, because of the bias.
> > You need a manually sorted corpus to get this k
* Matt Kettler wrote (19/04/07 14:49):
Matt Kettler wrote:
If you try to build it off a live feed and use SA's marking as the spam
criteria, your statistics are useless. Any rule with a high enough score
would get "perfect" results.. all the mail it matched would be spam, and
no nonspam. You hav
Matt Kettler wrote:
> If you try to build it off a live feed and use SA's marking as the spam
> criteria, your statistics are useless. Any rule with a high enough score
> would get "perfect" results.. all the mail it matched would be spam, and
> no nonspam. You have, essentially, created a "self fu
Robert Fitzpatrick wrote:
> I've seen some others on the list here show reports of the different
> rules and how much they hit.
Most of them are quoting the ones out of the official ruleset mass-check
results. Those are in the tarball under the rules directory as
STATISTICS*.txt
> How can I produc
I've seen some others on the list here show reports of the different
rules and how much they hit. How can I produce these reports? And is it
possible to produce a report like this by domain name?
--
Robert