Re: [SAtalk] Newbie questions again

2002-02-20 Thread Craig Hughes
The spam phrases stuff is calculated statistically based on a largish corpus of spam and nonspam emails (close to 100,000 messages all together, about half of each). You can find (slightly) more details in the wordfreqs directory in CVS -- basically there's a batch run which counts the frequency

[SAtalk] Newbie questions again

2002-02-20 Thread Mike Grau
Hello I am running SpamAssassin as a milter and am very pleased indeed. Can someone give me a brief explaination of the 40_spam_phrases.cf contents? For example, as in "spamphrase 29530 seventh heaven" what are the scores in the second column and how are they determined? Is it common to add you