Re: [SAtalk] question about the number of tokens analyzed in Bayes

2003-10-10 Thread Justin Mason
=?iso-8859-1?Q?Jean-S=E9bastien_Guay-Leroux?= writes: > What is the reason for Bayes in spamassassin to use the 150 most significant > tokens in a email if Paul Graham mentions that you only should use the > fifteen most significant ? It got better results in empirical testing. Check back throug

[SAtalk] question about the number of tokens analyzed in Bayes

2003-10-10 Thread Jean-Sébastien Guay-Leroux
What is the reason for Bayes in spamassassin to use the 150 most significant tokens in a email if Paul Graham mentions that you only should use the fifteen most significant ?   Quote from Paul Graham : “Fourth, they calculated probabilities differently. They used all the tokens, whereas