John Oliver wrote:

> I'm led to believe I can feed untagged spam (and tagged non-spam) to SA
> for it to "learn" how to be better.  How do I do this?

You were led right.  Assuming mbox format mail file -- for instance,
Netscape, Eurdora (and my FreeBSD account with qmail is mbox format) - you
can feed sa like (with D, for debug output):

./sa-learn -D --spam --mbox mailBoxFile

where the 'spam' parameter tells to learn as spam - use 'ham' for non-spam.
sa-learn automatically removes SA headers, so tagged or not, does not matter
- unless said tags are not SA's.

I've read it recommended to take out list mail when ham learning, and I too
thought this was a good idea, and have had good results, though the most
obvious reason to remove it, is I bypass filtering on list mail anyway.

You'll need 200 each of ham, spam, before Bayes will start working - the
mail should be from your own account/experience for best results.  It's also
been recommended that you should have equal numbers of both, and the more
the better, but upwards of 5000 (please correct if wrong) is overkill.

Me, I'm very light on ham, but still getting very good results.  I may
change my tune with regards to this imbalance, but so far, no false
positives.

Though several hundred caught, no spam in my in box today - again.

Bryan

> --
> John Oliver, CCNA                     http://www.john-oliver.net/
> Linux/UNIX/network consulting       http://www.john-oliver.net/resume/
> *    *    *    *    *    *    *     *    *    *    *    *    *    *    *
> Contribute to the SpamCon Legal Fund!! http://www.spamcon.org/legalfund/
>
> -------------------------------------------------------
> This SF.net email is sponsored by OSDN's Audience Survey.
> Help shape OSDN's sites and tell us what you think. Take this
> five minute survey and you could win a $250 Gift Certificate.
> http://www.wrgsurveys.com/2003/osdntech03.php?site=8

--
Nothing in the world has more potential for beauty than woman.  Nothing has
more potential to destroy it, than the world. - (Anonymous)

http://www.wecs.com/content.htm

This signature file is generated by Pick-a-Tag !
Written by Jeroen van Vaarsel
http://www.google.com/search?hl=en&ie=ISO-8859-1&q=pick-a-tag





-------------------------------------------------------
This SF.net email is sponsored by OSDN's Audience Survey.
Help shape OSDN's sites and tell us what you think. Take this
five minute survey and you could win a $250 Gift Certificate.
http://www.wrgsurveys.com/2003/osdntech03.php?site=8
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to