On Tue, Nov 19, 2002 at 09:39:13AM +0000, Matt Sergeant wrote: > The spammers have. An even better way they've found is to include a > snippet from a legit mailing list, but put it in a white text on white > background box. This was discussed on the spambayes mailing list.
Now, I am not a statistician but I am a mathematician. If my understanding of Bayesian statistics (and if people actually are being accurate when they call this method Bayesian), this shouldn't matter at all - that's the beauty of the process. If the Bayseian analysis actaully takes into account the joint and conditional densities of word frequency, and it has a reasonable way to assign an expectation to them (ie, if the corpus is seeded with real-non spam and real spam), the fact that a spam has been seeded with real words should show up in the joint and conditional frequency analysis. This would allow the filter to assign a spam score, though perhaps with a smaller confidence interval. Of course spammer can get around this by not writing spammy emails, but that's always been (and always will) be a sure-fire way of getting around content-oriented filtering. -- Ross Vandegrift [EMAIL PROTECTED] A Pope has a Water Cannon. It is a Water Cannon. He fires Holy-Water from it. It is a Holy-Water Cannon. He Blesses it. It is a Holy Holy-Water Cannon. He Blesses the Hell out of it. It is a Wholly Holy Holy-Water Cannon. He has it pierced. It is a Holey Wholly Holy Holy-Water Cannon. He makes it official. It is a Cannon Holey Wholly Holy Holy-Water Cannon. Batman and Robin arrive. He shoots them. ------------------------------------------------------- This sf.net email is sponsored by: To learn the basics of securing your web site with SSL, click here to get a FREE TRIAL of a Thawte Server Certificate: http://www.gothawte.com/rd524.html _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk