On Thu, 21 Sep 2006, mouss wrote: > Theo Van Dinter wrote: > > On Tue, Sep 19, 2006 at 10:58:46PM +0200, mouss wrote: > > > >> URI_NOVOWEL fires with things like href="#id" where id is a string that > >> starts with 7 "no-vowel" chars. > >> > >> uri URI_NOVOWEL m%^https?://[^/?]*[bcdfghjklmnpqrstvwxz]{7}%i > >> uri URI_NOVOWEL m%^https?://[^/?\#]*[bcdfghjklmnpqrstvwxz]{7}%i > >> > >> is this correct? > > > > That depends on your definition of "correct". The RE looks ok, but the > > hitrate could change dramatically. It's hard to say without testing. > > my understanding is that the rule looks for "dummy" hostnames in the > server part. unfortunately, the way URIs are "exposed" by SA, this rule > also applies to any thing that resembles a URI. This is a problem with > relative URIs (aka href="foo.html" if foo matches the rule).
Erm. How can it match relative and "#gibberish" URIs at all if the RE is explicitly anchored to "https?://" at the start of the URI? -- John Hardin KA7OHZ ICQ#15735746 http://www.impsec.org/~jhardin/ [EMAIL PROTECTED] FALaholic #11174 pgpk -a [EMAIL PROTECTED] key: 0xB8732E79 - 2D8C 34F4 6411 F507 136C AF76 D822 E6E6 B873 2E79 ----------------------------------------------------------------------- If someone has a gun and is trying to kill you, it would be reasonable to shoot back with your own gun. -- the Dalai Lama, May 15, 2001 -----------------------------------------------------------------------