I just tested those three rules below, and none of them work with 
"www.superveils . com" (ie. having a space both before and after 
that dot).

You might want to try my version of this rule instead - it's attached to avoid 
line wraps. Works well for double-spaces in a URL 
(including on either side of the dot), plus a whole bunch of other obfuscated 
URL possibilities. Score accordingly. I didn't 
actually write this rule myself, I copied it from somewhere (sorry, forgot 
where) but I've tweaked it slightly since then.

Cheers,
Jeremy





<[EMAIL PROTECTED]> wrote in message news:[EMAIL PROTECTED]
> On Thu, 8 Mar 2007, [EMAIL PROTECTED] wrote:
>
>> I searched the list and found this rule to catch URL with single space
>> (www.ledrx .com). Please help me in modifying this rule to catch URL
>> with double space (www.superveils . com).
>>
>> body URL_WITH_SPACE m/\bhttp:\/\/[a-z0-9\-.]+[!*%&, -]+\.?com\b/
>
> Personally I would make it something like this:
>
> # Handles www.  a.com, www.a .com, www. a .com, www . a.com, ...
> body __URL_WITH_SPACE1 /www[\ ]+?\.([a-z0-9\-]?\ [a-z0-9\-]?)+\.[
> ]+?(com|net|org)/
> # Handles www .xxx.com
> body __URL_WITH_SPACE2 /www[\ ]+\.([a-z0-9\-\ ]?)+\.[\ ]+?(com|net|org)/
> # Handles www.xxx. com
> body __URL_WITH_SPACE3 /www[\ ]+?\.([a-z0-9\-\ ]?)+\.[\ ]+(com|net|org)/
>
> meta URL_WITH_SPACE ( __URL_WITH_SPACE1 || __URL_WITH_SPACE2 ||
> __URL_WITH_SPACE3 )
> describe Body contains an URL with a space
> score URL_WITH_SPACE xx
>
> I did a few quick tests against some URL's, though it's untested against
> my ham & spam boxes :-)
>
> K.
>
> 


begin 666 jf_url_illegal_chars.txt
M=7)I(" @($I&7U523%])3$Q%1T%,7T-(05)3(" @("\H6V$M>C M.2Y?7"U=
M>S$L,S!]*#I;82UZ,"TY+E]<+5U[,2PS,'TI/UQ *3];82UZ,"TY+E]<+5U[
M,2PS,'U;7F$M>C M.2Y?7"U<+SHG7%M=6V$M>C M.2Y?7"U<0%U[,2PS,'TH
M/SHD?%PO?%P_?#I;,"TY72DO:0T*9&5S8W)I8F4@(" @2D9?55),7TE,3$5'
M04Q?0TA!4E,@(" @55),(&-O;G1A:6YS(&EL;&[EMAIL PROTECTED]<F%C=&5R<PT*
F<V-O<F4@(" @2D9?55),7TE,3$5'04Q?0TA!4E,@(" @,BXP#0H`
`
end


Reply via email to