On Thursday 02 May 2002 02:14 am, Michael Moncur wrote: > Actually it seems harmless - unlike the old spam phrases stuff, there's > still only one rule and PORN_3 has a score of 0.6, so it's not going to > push too many things over the threshold. > > Perhaps after testing it might be good to have a separate LOTS_OF_PORN_3 > rule that checks for a higher number...
Another idea: make a single regexp to search for any occurence of the words in the list. Then do a loop over $$fultext =~ m/PATTERN/g, and call pos() each iteration to get the match position, so as to make an array of positions of where the matches occured. Then you can determine if the words occur close to one another, like with the old PORN_3 rule, but much, much quicker than the complicated regexp for the old PORN_3. -- Visit http://dmoz.org, the world's | Give a man a match, and he'll be warm largest human edited web directory. | for a minute, but set him on fire, and | he'll be warm for the rest of his life. [EMAIL PROTECTED] ICQ: 132152059 | _______________________________________________________________ Have big pipes? SourceForge.net is looking for download mirrors. We supply the hardware. You get the recognition. Email Us: [EMAIL PROTECTED] _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk