Hi!

I'm new to this list, so please forgive me if my behavior is inappropriate in 
asking this question or if it has already been asked before.

I run SpamAssassin 2.61 and it catches a lot of spam, but lately, there is 
spam getting through that has bare dictionary words in the ASCII part of a 
MIME message and all the usual junk in the multimedia part. When reading 
these e-mails in Outlook or something like that, the client renders the 
messages beautifully and displays all the HTML and executes all the arbitrary 
code that comes with it. If I have frugal message display rules in KMail for 
instance, I only see the ASCII part, which looks a lot like this;

chop horse local mandarin blue caspian chlorine run
please quest jargon technical longevity

The amount of text is varying, but it appears difficult to train a bayes 
database to distinguish these as bad words, yet retain them as good words. 
Practically these spam mails don't auto-train on my users because I have the 
auto_learn_threshold_spam and auto_learn_threshold_nospam set way out of the 
required_hits range. That ensures that only AWL mail is treated as good 
enough to train on as is really bad spam. So for me this is a nuisance more 
than anything else, but it is getting to be irritating to many of my users.

After glancing at the RFCs for MIME (1521 and onwards), it appears that the 
intention of MIME is to carry two similar copies of a message, one padded 
with multimedia junk and one in bare text. Suppose some fuzzy comparison 
between the ASCII portion and the multimedia portion could be done? I don't 
know how fuzzy or even where to start, so I am asking on this list in case I 
have missed something obvious.

So the question finally, is, how do I protect against this type of spam?

-Alex


-------------------------------------------------------
This SF.net email is sponsored by: IBM Linux Tutorials.
Become an expert in LINUX or just sharpen your skills.  Sign up for IBM's
Free Linux Tutorials.  Learn everything from the bash shell to sys admin.
Click now! http://ads.osdn.com/?ad_id=1278&alloc_id=3371&op=click
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to