On Tue, 2002-04-30 at 11:11, rODbegbie wrote:
> OE would display it to the user, but SA would miss it for scoring.

That's a good point, especially since according to Theo it is not just
OE, it is IE's renderer that sees them as comments. We are talking about
what spammers are going to use to reach the vast majority of people who
end up seeing the HTML in IE, not web designers who want to make their
HTML work in as many browsers as possible.

Is there evidence of comments with extra spaces in either the spam or
the non-spam corpus? Or is this all hypothetical?

Perhaps the right thing to do is to strip comments out using a simple
check for '<!--' ... '-->'. If spammers start to take advantage of the
looser syntax of IE, we can add a rule that scores up comments that have
leading or trailing spaces. If it turns out that the only people who use
illegal comment syntax are spammers trying to get around SA, you can
look for that without even having to parse the text that it is trying to
hide.

So I would suggest matching for comments without the extra spaces and
only add other rules if spam starts showing up that justifies them.

 -- sidney



_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to