Another Newbie question here,

So IRIs find links in the body. I'm trying to get a handle on URI syntax and have found several disparate examples:


1) uri HTTP_CTRL_CHARS_HOST /^https?\:\/\/[^\/\s]*[\x00-\x08\x0b\x0c \x0e-\x1f]/

2) uri NORMAL_HTTP_TO_IP                m{^https?://\d+\.\d+\.\d+\.\d+}i

3) uri URI_4YOU                 [EMAIL 
PROTECTED](?:https?://|mailto:)[^\/[EMAIL PROTECTED]

4) uri HTTP_77                  /http:\/\/.{0,2}\%77/

5) uri BARGAIN_URL              /bargain([sz]|-\S+)?\.(?:com|biz)/

6) uri URI_OFFERS                       m/offer([sz]|-\S+)?\.(?:com|bi?z)/i

7) uri URI_AFFILIATE            /aff\w+id=/i


I have a few questions and welcome other tips. What do m{, m/, and m@ mean? Are m||, m(), and m{} interchangeable or does each mean something different? Does it matter if the ^ is on the outside (3) or the inside (1&2) of the beginning? I see the value of URIs with 5-7 so an anchor is not needed, is there an improvement over rawbody when http is used as in 1-4?

Thanks,
Dan

Reply via email to