On Tue, 2002-04-30 at 11:11, rODbegbie wrote: > OE would display it to the user, but SA would miss it for scoring.
That's a good point, especially since according to Theo it is not just OE, it is IE's renderer that sees them as comments. We are talking about what spammers are going to use to reach the vast majority of people who end up seeing the HTML in IE, not web designers who want to make their HTML work in as many browsers as possible. Is there evidence of comments with extra spaces in either the spam or the non-spam corpus? Or is this all hypothetical? Perhaps the right thing to do is to strip comments out using a simple check for '<!--' ... '-->'. If spammers start to take advantage of the looser syntax of IE, we can add a rule that scores up comments that have leading or trailing spaces. If it turns out that the only people who use illegal comment syntax are spammers trying to get around SA, you can look for that without even having to parse the text that it is trying to hide. So I would suggest matching for comments without the extra spaces and only add other rules if spam starts showing up that justifies them. -- sidney _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk