> On Thu, Feb 10, 2005 at 11:48:18AM +0100, Sander Holthaus - 
> Orange XL wrote:
> > Your (mail)logs might come in handy for this, if you write out 
> > SpamAssassin's basic output there. With a basic Perl-script 
> (you can 
> > do this in almost any other script-language of course) you can see 
> > most likely everything you need. Spam, ham and mail-scores, 
> > scan-times, tests that where hit (!), etc. With only a small bit of 
> > programming, you can calculate and see everything you need! 
> You should 
> > check wat AWL and BAYES -tests are doing, especially if 
> they hit on Spam.
> 
> True.  Maybe I was to lazy to think about that ;)
> 
> I was looking at the logfile /var/log/mail.info which shows 
> which rules were used, but not with the individual values e.g.  
> 
> Feb 10 14:42:44 mail1 spamd[16031]: result: . -2 - 
> AWL,BAYES_20,DRUG_ED_CAPS,HTML_MESSAGE
> scantime=0.1,size=3491,mid=<01E4C22DDCD5E94DAC1863202903F26809
[EMAIL PROTECTED]>,
> bayes=0.0983660349113599,autolearn=disabled
> 
> But in exim's rejectlogs the full spamreport appears.

Well, I didn't get to it either until recently. I think there are not too
many who automate analysis of spamassassin output. While it is quite handy.
>From looking at the entry above, I think a few changes could be made to your
setup. Indeed you appear to have a problem with AWL, it shouldn't hit on
spam. But it think it is more likely to be related to the fact that messages
which are spam aren't getting enough hitpoints to be seen as spam.
Bayes_20 is also quite low (but not that unusual) for a spam-mail, not to
mention that only two other rules hit on the message. Do you perform any
networks-tests? (Pyzor, Razor, DCC, URIDNSBL)

> 
> > When I upgraded, (2.64 > 3.02) I noticed only a small increase in 
> > scores for spam and decrease for ham from SpamAssassin. Not the big 
> > results I had hoped for, but I'll patiently wait for 3.1. Overall 
> > results are slightly better, and technically, there should 
> be a lower 
> > possiblility of ham being marked as spam (due to 
> SPF-checking, did you install that?).
> 
> No, I did not install SPF-checking. I will have to read up about it.  

It is a nice addition, though not widely implemented (most major
webmail-providers use SPF nowadays, but many medium- and small
ISP's/webmail-providers don't). http://spf.pobox.com will tell you what it
is.

> > As to your setup. How up to date are those extra custom rules? 
> 
> A few days ago.

That's good. No problem there.

> > Any reason
> > why your are using 70_sare_html2.cf and 70_sare_html3.cf but not 
> > 70_sare_header0, cf70_sare_header1.cf, 70_sare_genlsubj0.cf, 
> > 70_sare_genlsubj1.cf, etc, etc...?
> 
> I did not know about them. 

Check out www.rulesemporium.com You will find all available rules,
descriptions and hints how to use them. There are also links to none
sare-rules, which can give excellent results too (e.g. chickenpox, weeds /
weeds2 and mangeled to name just a few).

> > There are more effective rules out there than just 
> sare_html or just 
> > sare rules!
> 
> > I use most of the Sare-rules + some extra rules, and 
> results are very 
> > good (though watch your memory and scantimes!). Have yet to see a 
> > false positive with a treshold of 9, and only 1-2% of all 
> traffic scores between 5 and 9.
> 
> I have tried now to download them with rule_du_jour and it 
> ends with an error:
> 
> 70_sare_bayes_poison_nxm.cf was up to date [skipped 
> downloading of 
> http://www.rulesemporium.com/rules/70_sare_bayes_poison_nxm.cf ] ...
> 
> No index found for ruleset named SARE_GENLSUBJ2.  Check that 
> this ruleset is still valid.
> 
> No index found for ruleset named SARE_GENLSUBJ2.  Check that 
> this ruleset is still valid.
> 
> No index found for ruleset named SARE_GENLSUBJ3.  Check that 
> this ruleset is still valid.
> 
> No index found for ruleset named SARE_GENLSUBJ_ARC.  Check 
> that this ruleset is still valid.
> 
> No index found for ruleset named SARE_GENLSUBJ_ENG.  Check 
> that this ruleset is still valid.
> 
> No index found for ruleset named SARE_GENLSUBJ.  Check that 
> this ruleset is still valid.
> No files updated; No restart required.
> 
> 
> 
> 
> 
> Rules Du Jour Run Summary:RulesDuJour Run Summary on archive3:
> 
> No index found for ruleset named SARE_GENLSUBJ2.  Check that 
> this ruleset is still valid.
> 
> No index found for ruleset named SARE_GENLSUBJ2.  Check that 
> this ruleset is still valid.
> 
> No index found for ruleset named SARE_GENLSUBJ3.  Check that 
> this ruleset is still valid.
> 
> No index found for ruleset named SARE_GENLSUBJ_ARC.  Check 
> that this ruleset is still valid.
> 
> No index found for ruleset named SARE_GENLSUBJ_ENG.  Check 
> that this ruleset is still valid.
> 
> No index found for ruleset named SARE_GENLSUBJ.  Check that 
> this ruleset is still valid.

I'm not usung rules_du_jour myself, but it may be that the
nameing-convention or url of those rules has changed. You might want to
update Rules_du_jour if it is not a recent version. But I'm too unfamiliar
with rules_du_jour to give you a good/exact answer.
As a sidenote, I wouldn't start by installing any of the sare _2 and _3
rules.

Kind Regards and Success!
Sander Holthaus

Reply via email to