On Tue, 25 Nov 2003, Robert Menschel wrote: > -----BEGIN PGP SIGNED MESSAGE----- > Hash: SHA1 > > Hello Aaron, > > Tuesday, November 25, 2003, 8:58:58 AM, you wrote: > > AY> ... Recently I started getting a lot of false positives with SA 2.60. > AY> I noticed that all my mail was getting a bayesian score of 99 to > AY> 100%. ...My best guess is that since the bayes database only holds a > AY> limited number of tokens, my DB was filling up with spam tokens and > AY> not enough non-spam tokens. Maybe this happened because I only get > AY> about 10-20 legitimate emails a week versus about 100+ spam emails a > AY> day. > > In November to date, I've trained my Bayes on 683 ham and 6816 spam. > Ratio therefore seems to be about the same as yours. I haven't seen any > evidence of the problem -- Bayes is working wonderfully here. > > Bob Menschel
Having had an experience similar to Aaron's I can believe that he could be having problems with a poisoned Bayes. For example, suppose that you've received a large number of "Nigerian" spams that were learned as such. That would put spam scores on a large number of converstational words. In a fit of pique, I had tossed a whole bunch of "Nigerian" spams in my bayes. It got so bad that a test email that contained only one word ("Hi") got a Bayes 99% spam score. I had to trash the DB and start from scratch. So the quality of Bayes scoring does depend upon how it is trained. It is a tool not a magic bullet, and like any tool can be misused or abused. Spammers seem to be learning this, I'm seeing an increasing number of spams that contain "Bayes poison". Dave -- Dave Funk University of Iowa <dbfunk (at) engineering.uiowa.edu> College of Engineering 319/335-5751 FAX: 319/384-0549 1256 Seamans Center Sys_admin/Postmaster/cell_admin Iowa City, IA 52242-1527 #include <std_disclaimer.h> Better is not better, 'standard' is better. B{ ------------------------------------------------------- This SF.net email is sponsored by: SF.net Giveback Program. Does SourceForge.net help you be more productive? Does it help you create better code? SHARE THE LOVE, and help us help YOU! Click Here: http://sourceforge.net/donate/ _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk