On Feb 14, 2005, at 3:34 PM, Thomas Arend wrote:
Am Montag, 14. Februar 2005 20:50 schrieb Daniel Caņas:I have over 2000 emails that I have as ham and would like to feed to sa-learn..
You should train them as ham.
That is my plan
The emails are all mine (that is they are addresed to me) is this a problem for sa-learn?
Where is the problem? If they are not for you, why did you get them?
Will it learn the headers and mark my email address as a token for ham... causing bayes to not work correctly for my address?
The address will be one token. If you feed spam to sa-learn your address will
be also a token for spam. But bayes does not work only on one token.
I have legit spam that I want to learn but I am afraid to do it if I don't have corresponding number of ham.
To my opinion and expirience this is bullshit.
Cool.. this is good to know as I can collect tons of spam.
I guess the question is:
Is feeding a bunch of emails addressed to a single person into sa-learn
a good thing to do?
Why not? I run spamassassin on a single user system. You can have an
individual database for every user or a common db for all users. In the last
case you should train spam not only for one user.
I just switched to sitewide bayes and the spam I train is addressed to different users.
Mostly non-existent users on my system whose mail is forwarded to the admin account.
Thomas -- icq:133073900 http://www.t-arend.de