Re: sidewide learning need further informations

Bowie Bailey Wed, 01 Sep 2010 06:10:22 -0700

 On 9/1/2010 5:59 AM, escp wrote:
> Hi,
>
> i use spamassassin with sidewide setup.
>
> i want to know, if i can ONLY learn spam w/o learning ham. I dont want to
> get bad results.
>
> Problem is, that i get a lot of mails every day and cant have a look at all.
> so i only want to train spamassassin with the mails my users want to get rid
> off.


The bayes database (which is what you are teaching with sa-learn) works
with statistics.  It determines if a message is spam by determining if
the tokens (mostly words) in the message appear more frequently in ham
or spam.  If you only teach it spam, then it will start thinking that
everything is spam.  You need to learn both ham and spam for the bayes
engine to work right.  It doesn't have to be 50/50, but it does need a
decent amount of ham.

-- 
Bowie

Re: sidewide learning need further informations

Reply via email to