rules better than bayes?

jo3 Mon, 09 Jan 2006 11:28:18 -0800

Hi,

This is an observation, please take it in the spirit in which it isintended, it is not meant to be flame bait.

After using spamassassin for six solid months, it seems to me that thebayes process (sa-learn [--spam | --ham]) has only very limited successin learning about new spam. Regardless of how many spams and hams aresubmitted, the effectiveness never goes above the default level which,in our case here, is somewhere around 2 out of 3 spams correctlyidentified. By the same token, after adding the "third party" rule,airmax.cf, the effectiveness went up to 99 out of 100 spams correctlyidentified.

So far, we have not had a single ham misidentified as spam with over onemillion messages examined.

Throughout the documentation, there seems to be a bias toward the bayesfilter rather than the rule system. Does anyone on the list have somethoughts which would help to explain my observation as to why a singlerule would appear so successful while a million spams and hams wouldhave so little effect?


Thank you,
Jo3

rules better than bayes?

Reply via email to