On Thu, 21 Sep 2006, mouss wrote:

> Theo Van Dinter wrote:
> > On Tue, Sep 19, 2006 at 10:58:46PM +0200, mouss wrote:
> >   
> >> URI_NOVOWEL fires with things like href="#id"  where id is a string that 
> >> starts with 7 "no-vowel" chars.
> >>
> >> uri URI_NOVOWEL             m%^https?://[^/?]*[bcdfghjklmnpqrstvwxz]{7}%i
> >> uri URI_NOVOWEL             m%^https?://[^/?\#]*[bcdfghjklmnpqrstvwxz]{7}%i
> >>
> >> is this correct?
> >
> > That depends on your definition of "correct".  The RE looks ok, but the
> > hitrate could change dramatically.  It's hard to say without testing.
> 
> my understanding is that the rule looks for "dummy" hostnames in the 
> server part. unfortunately, the way URIs are "exposed" by SA, this rule 
> also applies to any thing that resembles a URI. This is a problem with 
> relative URIs (aka href="foo.html" if foo matches the rule).

Erm. How can it match relative and "#gibberish" URIs at all if the RE
is explicitly anchored to "https?://" at the start of the URI?

--
 John Hardin KA7OHZ    ICQ#15735746    http://www.impsec.org/~jhardin/
 [EMAIL PROTECTED]    FALaholic #11174    pgpk -a [EMAIL PROTECTED]
 key: 0xB8732E79 - 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
  If someone has a gun and is trying to kill you, it would be
  reasonable to shoot back with your own gun.
                                      -- the Dalai Lama, May 15, 2001
-----------------------------------------------------------------------

Reply via email to