On Feb 14, 2005, at 3:34 PM, Thomas Arend wrote:

Am Montag, 14. Februar 2005 20:50 schrieb Daniel Caņas:
I have over 2000 emails that I have as ham and would like to feed to
sa-learn..

You should train them as ham.

That is my plan



The emails are all mine (that is they are addresed to me) is this a problem for sa-learn?

Where is the problem? If they are not for you, why did you get them?

Will it learn the headers and mark my email address as a token for ham... causing bayes to not work correctly for my address?

The address will be one token. If you feed spam to sa-learn your address will
be also a token for spam. But bayes does not work only on one token.



I have legit spam that I want to learn but I am afraid to do it if I don't have corresponding number of ham.

To my opinion and expirience this is bullshit.

Cool.. this is good to know as I can collect tons of spam.


I guess the question is:
Is feeding a bunch of emails addressed to a single person into sa-learn
a good thing to do?

Why not? I run spamassassin on a single user system. You can have an
individual database for every user or a common db for all users. In the last
case you should train spam not only for one user.



I just switched to sitewide bayes and the spam I train is addressed to different users.
Mostly non-existent users on my system whose mail is forwarded to the admin account.



Thomas -- icq:133073900 http://www.t-arend.de



Reply via email to