Hello Clive,

Wednesday, December 24, 2003, 8:09:42 AM, you wrote:

CD> I am receiving several spam messages daily in which the message body appears
CD> to consist entirely of random words.

Check again, and I think you'll find that the message has nothing to do
with those random words.  The message is either in a graphic attachment
or an HTML attachment, and the random words are to help the email slip
through filters.

CD> Spamassassin is not catching these messages.  The Bayesian filter has not yet
CD> kicked in but I am running uncaught spam through sa-learn.  I am concerned
CD> about whether the Bayesian filter will be biased by an accumulation of random
CD> words and so I am seeking advice as to whether I should continue to run this
CD> class of spam through sa-learn.

Continuing running it through sa-learn. I used to avoid this, but changed
my mind about 2 months back, and have seen no problems learning these
random words. What we're really teaching SA about are the headers and the
other useful attributes of the message; the random words are just noise
which at this level doesn't seem to affect anything (except maybe make
the Bayes database grow a little in size).

Bob Menschel





-------------------------------------------------------
This SF.net email is sponsored by: IBM Linux Tutorials.
Become an expert in LINUX or just sharpen your skills.  Sign up for IBM's
Free Linux Tutorials.  Learn everything from the bash shell to sys admin.
Click now! http://ads.osdn.com/?ad_id=1278&alloc_id=3371&op=click
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to