Hello,
My problem is: The bayes filter does (auto-)learn ham mails but no
spam mails. In my logs I found spam mails that have a very high score
and should be autolearned. I think my bayes setup is correct, because
ham mails are learned as expected.
Autolearning is based on a different score to the one used for
classification. It uses the rule scores that would be used if Bayes
were disabled and ignores some type of rule altogether. For spam you
need 3 points from *both* the headers and the body.
It probably is working correctly, as far as I can tell you haven't
actually established that no spam is being autolearned.
Ok, there's a different score - I understand. But what worries me is,
that I found mails in the log with a really high spam score and lots of
rule matches. For all the (SA) marked spam mails I do not get a line in
the logs with 'autolearn=no' as is the case for all ham mails. My mail
server is now running for a week with hundreds of ham mails in the
filter but no spams. It's a company server with 300 users and a long
established MX record. So for me it is hard to imagine that there is no
spam mail suitable for filter learning.
I suspect the milter style of integration reject leads to a situation in
which the spam learning part is suppressed. So there's no message about
the mail being learned or not in the logs.
What do you think? Any chance to debug this?
Dieter