On 9/1/2010 5:59 AM, escp wrote: > Hi, > > i use spamassassin with sidewide setup. > > i want to know, if i can ONLY learn spam w/o learning ham. I dont want to > get bad results. > > Problem is, that i get a lot of mails every day and cant have a look at all. > so i only want to train spamassassin with the mails my users want to get rid > off.
The bayes database (which is what you are teaching with sa-learn) works with statistics. It determines if a message is spam by determining if the tokens (mostly words) in the message appear more frequently in ham or spam. If you only teach it spam, then it will start thinking that everything is spam. You need to learn both ham and spam for the bayes engine to work right. It doesn't have to be 50/50, but it does need a decent amount of ham. -- Bowie