At 04:51 PM 9/26/03 -0700, Chad Simmons wrote:

I've noticed after installing Spamassassin, I'm still getting a large number of
messages passed through without matching any of the rules.


Many spammers are using paypal.com to get their emails "whitelisted". Also I've
noticed that they will use invalid HTML tags to hide high risk words from
content filter tests (e.g. Vi<kajaneisdala>co<jakaidlalaix>din).


I was wondering if there was a way to have spam assassin strip out all the HTML
tags before it does the content searching for "bad" words, etc.

What version of SA are you running? SA has been using HTML tag stripping as a part of everyday operations for a very, very long time.
Standard procedure for a "body" rule is that all HTML tags are stripped, QP and escape codes decoded, and EOLs removed prior to running the rules. That's even in the manpage Mail::SpamAssassin::Conf.


However, some of the invalid tags confused the parser, and it's been steadily improved to adapt to the wildly invalid tags used. I've not tested 2.60's mettle against the latest wave, but I suspect it fares pretty well.. Even 2.4x had that feature, although it was easily confused by invalid tags at that point it time.




------------------------------------------------------- This sf.net email is sponsored by:ThinkGeek Welcome to geek heaven. http://thinkgeek.com/sf _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to