On Sat, 8 Nov 2003, Terry Milnes wrote:

> The bayes filtering works great, but the typical user is not going to
> want to jump through what he would consider the huge obstacles to train
> a corpus. Furthermore implementing bayes on a system that incorporates
> thousands of users can be a daunting task, and isn't even an available
> option to some of us.

It is true that Bayes works best if you can customize it on a per-user
basis bit that is NOT necessary. It DOES work even when left to run
on a site-wide basis with just the training from auto-learn.

As an administrator running SA with Bayes site-wide on a system that
processes tens of thousands of messages a day for thousands of users
with no per-user configs, I know of what I speak.

If you cannot do any hand-correcting (re-feed it ham/spam to correct
mistakes) you might want to adjust the scores so that just a
Bayes score cannot be responsible for the total determination of
'spam'.  IE with the default spam threshold ==5 and default Bayes 100%
score ==5.4, a mistake in Bayes learning could be soely responsible
for a message being marked as 'spam'.
So crank up your spam threshold to 6 or so to require some other rules
to corroborate the Bayes assessment.

Bayes does use up a bit of memory and CPU, but it's small potatoes
compaired to some of the add-in rules that have been discussed on
this list (Hi Chris ;).

So please give me one good reason why you say Bayes:
 "isn't even an available option to some of us"

-- 
Dave Funk                                  University of Iowa
<dbfunk (at) engineering.uiowa.edu>        College of Engineering
319/335-5751   FAX: 319/384-0549           1256 Seamans Center
Sys_admin/Postmaster/cell_admin            Iowa City, IA 52242-1527
#include <std_disclaimer.h>
Better is not better, 'standard' is better. B{



-------------------------------------------------------
This SF.Net email sponsored by: ApacheCon 2003,
16-19 November in Las Vegas. Learn firsthand the latest
developments in Apache, PHP, Perl, XML, Java, MySQL,
WebDAV, and more! http://www.apachecon.com/
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to