Re: URI Basics

Dan Mon, 24 Apr 2006 17:18:45 -0700

In 3 ^ is the first character of the regex, just as it is in 1 and2. It
is also inside the delimiters, just like 1 and 2. In example 3 @ is
being used as a delimiter,  and ^ is the first character after it.

Are you saying that in URIs, any character (@ in this case) can serveas the delimiter, so long as it displays after the m and again at theend of the entry?

I'm beginning to realize how many of my learning curve issues areattempts to understand the very structure of a system created with abare minimum of structure.

There is definitely a VERY significant performance penalty to using
rawbody over URI, for any rule.

Consider the size of input. A rawbody regex must be run against the
entire text of the body after QP decoding. A uri regex must be run

against all the text of the URIs that SA found. There is likely tobe at

least a 100:1 difference in size of input. There's no "penalty" for
using a uri rule, as SA will always extract all the URIs and build the
input text, even if you aren't using it.


Great information Matt, thanks.


Dan

Re: URI Basics

Reply via email to