By chi2 "combining", does that mean it affects the way that messages are tokenized and probabilities inputted into the bayes DB, or is it the way that the end score is calculated from the DB?
Also, are there any downsides to using it? If so, what? Thanks, Matt -----Original Message----- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Sent: Tuesday, September 30, 2003 2:47 PM To: Matt Tolton Cc: [EMAIL PROTECTED] Subject: Re: [SAtalk] chi2 combining Matt Tolton writes: > 1. I've been trying to gather information about chi2 combining which is provided as > an option in spam assassin. I've searched the list archives, the web, and > newsgroups and haven't come up with much. Could someone please fill me in on what > the difference is here, and what advantages/disadvantages it gives? It's a nifty combining scheme suggested by some folks on the spambayes project, which has some very nice properties in (a) putting messages where the classifier is "mostly sure" right at 0.0 or near 1.0, (b) avoiding "cancellation disease", and (c) still putting mails where it really is "unsure" around 0.5. I can't find the exact discussion now, but it's somewhere around: http://mail.python.org/pipermail/spambayes/2002-September/ http://mail.python.org/pipermail/spambayes/2002-October/ Cancellation disease is covered in http://mail.python.org/pipermail/spambayes/2002-October/001236.html . > 2. Will the ***SPAM*** tag that I have put in the subject line affect > bayesian learning when I use sa-learn? Do I need to take that out? > 3. Does sa-learn take out the spamassassin header information > automatically (I thought I read in the docs that it does, but I can't > seem to find it.), or do I need to specifically filter out those > headers? FAQ: http://spamassassin.taint.org/faq/index.cgi?req=show&file=faq05.002.htp --j. ------------------------------------------------------- This sf.net email is sponsored by:ThinkGeek Welcome to geek heaven. http://thinkgeek.com/sf _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk