Alex Woick wrote:
The proper usage of the Bayes filter is very simple: feed spam as spam and ham as ham. All of your mail. Don't care for content that might be mis-learned in your eyes: it will not be mis-learned. Don't try be smarter than the filter. The only exception is bounce-messages: don't feed them at all.
Why not? I've found that to be a wonderful way to discard third-party backscatter without dropping the legitimate notices regarding mail my customers actually sent (to the wrong place, mind you...)
The only things I don't feed to Bayes are very large messages (since those will fly past SA based on size alone, and the processing time isn't worth the relatively small benefit) and mail that I *can't* identify. (Yes, I *have* met some. O_o Yick.)
Results since I upgraded my personal system to 2.53 (IIRC) and the ISP systems I maintained at the time to 2.54 have been pretty darn good, IMO.
Way better than Postini seems to be doing these days. <shudder> -kgd