> >I was thinking today wouldn't it be better to just ignore all the periods, > >commas, and what have you in the text? Inside SA we could just drop those > >and then search the message from that. > > > >I've had one spammer who just puts a random period in the message and it > >doesn't get tagged. Taking out all the periods in the message and it > >scored a 10.6 just from the body of the message. > .... > Up_to 500% m.ore S.PERM! > * ADD UP_TO 500% M.ORE SPER.M > * MALE MULTIPLE ORGAS.MS > * HAVE M.ORE INTENSE 0.RGASMS > * PRODUCE ST.RONGER E.RECTIONS > * HAVE A STRONGER 5.EXUAL DESIRE > * 1.NCREASED S.E..XUAL STAMINA > <http://203.197.204.156/pi/>FULLY DO.CTOR APP.ROVED! L.EARN MORE! > 100% MON.EY BAC.K SATISF.ACTION GUA.RANTEE!
Could a rule be developed to strip out the punctuation, and then process the message through the rest of the SA rules? This way, people who do not use spaces in their sentences will not cause false hits, but spammer's words will be reduced to what they really are, and then get caught as intended by SA. Bayes does help a bit, but with the random sentences at the end, and for some reason AWL hits for From and To the same (bug??) a few of these trickle through each day (The ones with periods in every word). Rob ------------------------------------------------------- This SF.Net email sponsored by: ApacheCon 2003, 16-19 November in Las Vegas. Learn firsthand the latest developments in Apache, PHP, Perl, XML, Java, MySQL, WebDAV, and more! http://www.apachecon.com/ _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk