Re: SA bayes filter learns ham but no spam

Tom Hendrikx Wed, 17 Jun 2015 14:43:01 -0700


On 17-06-15 20:00, Dieter Scholz wrote:
> Hello,
> 
>>> My problem is: The bayes filter does (auto-)learn ham mails but no
>>> spam mails. In my logs I found spam mails that have a very high score
>>> and should be autolearned. I think my bayes setup is correct, because
>>> ham mails are learned as expected.
>> Autolearning is based on a different score to the one used for
>> classification. It uses the rule scores that would be used if Bayes
>> were disabled and ignores some type of rule altogether. For spam you
>> need 3 points from *both* the headers and the body.
>>
>> It probably is working correctly, as far as I can tell you haven't
>> actually established that no spam is being autolearned.
> 
> Ok, there's a different score - I understand. But what worries me is,
> that I found mails in the log with a really high spam score and lots of
> rule matches. For all the (SA) marked spam mails I do not get a line in
> the logs with 'autolearn=no' as is the case for all ham mails. My mail
> server is now running for a week with hundreds of ham mails in the
> filter but no spams. It's a company server with 300 users and a long
> established MX record. So for me it is hard to imagine that there is no
> spam mail suitable for filter learning.
> 
> I suspect the milter style of integration reject leads to a situation in
> which the spam learning part is suppressed. So there's no message about
> the mail being learned or not in the logs.
> 
> What do you think? Any chance to debug this?
> 
> Dieter



As far as I can see, you don't really know whether your setup is
actually learning spam or not. The logs just don't mention anything
useful about spam email. Could you check the bayes stats so we actually
know if there is a problem?

Please show us output of "sa-learn --dump magic", probably ran as the
user amavisd user.

Regards,
        Tom

Re: SA bayes filter learns ham but no spam

Reply via email to