> From: Thomas Cameron [mailto:[EMAIL PROTECTED] > Sent: Thursday, August 25, 2005 6:03 PM > To: users@spamassassin.apache.org > Subject: Re: phish/bayes > > On Thu, 2005-08-25 at 15:49 -0700, satalk (sent by Nabble.com) wrote: > > I could not find any email in this forum addressing this issue - it > > does not mean there is not one - I just could'nt find it :) > > > > MY question is as follows: > > Given that so many valid tokens from ebay/paypal sites > exist in phish > > emails, am I correct in saying that it is imperative to avoid phish > > emails entering the bayes database? > > It has been my experience that the more of them I teach > Bayes, the less get through. None of my legit eBay/PayPal > e-mail has been tagged.
Mine too -- and we likely need to remind the original poster that it is VERY important to also train some VALID emails from the real source that such phishes are targetting. This puts the real mails words in as tokens an means that the words in both types will not be strong indicators of spam (or ham) and other differences will be used to make the estimate. -- Herb Martin