Simon Byrnand wrote:
Hi All,
After going from 2.64 to 3.0.3 I thought Bayes was working much better - previously certain classes of spam were being consistently reported as ham, scoring BAYES_00 no matter what I did, or how much manual training I did. (Autolearning enabled)
After upgrading to 3.0.3 and clearing the Bayes database everything seemed fine for a week or so, now it's back to its old habits :(
Particularly frustrating is the complete inability of sa-learn to correct the thinking of Bayes - all the recent flood of German spams are scoring BAYES_00, and DESPITE the fact that I have manually learnt well over two dozen of these as spam (which includes all the variations of them I've seen so far) new copies of identical spams STILL score BAYES_00. WHY ?
If the autolearn system can't be overridden with some manual learning, it makes it more of less useless :(
A few other spams that were previously getting BAYES_99 are now down to BAYES_00 for no apparent reason. It's highly unlikely that they were autolearnt as ham, as they hit several other tests too. It seems that Bayes is still exploitable... :(
Any suggestions ?
Regards, Simon
Clear your bayes database and start all over again. Switch off auto-learning and rely purely on manual learning in a feedback loop. Grab a mail box of known ham and another folder of known spam. Preferably use a thousand of each. If you ever switch on autolearning again. Set the treshold at -0.2 for ham and 10 or 15 for spam.
Enable network tests, razor2, pyzor and dcc work wonders on the site I administer.
Good luck,
Jo