On Sat, 8 Nov 2003, Terry Milnes wrote: > The bayes filtering works great, but the typical user is not going to > want to jump through what he would consider the huge obstacles to train > a corpus. Furthermore implementing bayes on a system that incorporates > thousands of users can be a daunting task, and isn't even an available > option to some of us.
It is true that Bayes works best if you can customize it on a per-user basis bit that is NOT necessary. It DOES work even when left to run on a site-wide basis with just the training from auto-learn. As an administrator running SA with Bayes site-wide on a system that processes tens of thousands of messages a day for thousands of users with no per-user configs, I know of what I speak. If you cannot do any hand-correcting (re-feed it ham/spam to correct mistakes) you might want to adjust the scores so that just a Bayes score cannot be responsible for the total determination of 'spam'. IE with the default spam threshold ==5 and default Bayes 100% score ==5.4, a mistake in Bayes learning could be soely responsible for a message being marked as 'spam'. So crank up your spam threshold to 6 or so to require some other rules to corroborate the Bayes assessment. Bayes does use up a bit of memory and CPU, but it's small potatoes compaired to some of the add-in rules that have been discussed on this list (Hi Chris ;). So please give me one good reason why you say Bayes: "isn't even an available option to some of us" -- Dave Funk University of Iowa <dbfunk (at) engineering.uiowa.edu> College of Engineering 319/335-5751 FAX: 319/384-0549 1256 Seamans Center Sys_admin/Postmaster/cell_admin Iowa City, IA 52242-1527 #include <std_disclaimer.h> Better is not better, 'standard' is better. B{ ------------------------------------------------------- This SF.Net email sponsored by: ApacheCon 2003, 16-19 November in Las Vegas. Learn firsthand the latest developments in Apache, PHP, Perl, XML, Java, MySQL, WebDAV, and more! http://www.apachecon.com/ _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk