On Mon, 09 Jun 2003 07:13:43 +0200, you wrote:

>Does anyone have a good regular expression for capturing all http URL in 
>a string?
>now i'm using ?(http://[a-z0-9-/_;&=+-\.\?:@]+)\b?mi
>in one of my programs, that is, all strings beginning with http:// and 
>contain chars like a-z0-9 ...
>
>does anione have something better?

Hmm. I just glanced at RFC 2396, which offers this regex to validate an URI:

^(([^:/?#]+):)?(//([^/?#]*))?([^?#]*)(\?([^#]*))?(#(.*))?
 12            3  4          5       6  7        8 9

the numbers indicate the reference points for each subexpression

    1 = http:
    2 = http
    3 = //www.ics.uci.edu
    4 = www.ics.uci.edu
    5 = /pub/ietf/uri/
    6 = <undefined>
    7 = <undefined>
    8 = #Related
    9 = Related

http://www.ietf.org/rfc/rfc2396.txt


-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php

Reply via email to