We all know that a human is far better than a proram at detecting spam. The computer has to make guesses at it. So to make it harder spammers add all sorts of junk in that will make it harder for the computer to recognize. This is while still maintaining some sort of readability for the victims. Either in gappy text, periods / other marks seperation, taking advantage of HTML code, and other things.
I was thinking today wouldn't it be better to just ignore all the periods, commas, and what have you in the text? Inside SA we could just drop those and then search the message from that. I've had one spammer who just puts a random period in the message and it doesn't get tagged. Taking out all the periods in the message and it scored a 10.6 just from the body of the message. Just a thought. Jason Portwood [EMAIL PROTECTED] ------------------------------------------------------- This SF.Net email sponsored by: ApacheCon 2003, 16-19 November in Las Vegas. Learn firsthand the latest developments in Apache, PHP, Perl, XML, Java, MySQL, WebDAV, and more! http://www.apachecon.com/ _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk