On Thursday 02 May 2002 02:14 am, Michael Moncur wrote:

> Actually it seems harmless - unlike the old spam phrases stuff, there's
> still only one rule and PORN_3 has a score of 0.6, so it's not going to
> push too many things over the threshold.
>
> Perhaps after testing it might be good to have a separate LOTS_OF_PORN_3
> rule that checks for a higher number...

Another idea: make a single regexp to search for any occurence of the words in 
the list.  Then do a loop over $$fultext =~ m/PATTERN/g, and call pos() each 
iteration to get the match position, so as to make an array of positions of 
where the matches occured.  Then you can determine if the words occur close 
to one another, like with the old PORN_3 rule, but much, much quicker than 
the complicated regexp for the old PORN_3.

-- 
Visit http://dmoz.org, the world's   | Give a man a match, and he'll be warm
largest human edited web directory.  | for a minute, but set him on fire, and
                                     | he'll be warm for the rest of his life.
[EMAIL PROTECTED]  ICQ: 132152059 |

_______________________________________________________________

Have big pipes? SourceForge.net is looking for download mirrors. We supply
the hardware. You get the recognition. Email Us: [EMAIL PROTECTED]
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to