On Fri, 10 Jul 2009, Sim wrote:

/\bwww(?:\s\W?\s?|\W\s)\w{3,6}\d{2,6}(?:\s\W?\s?|\W\s)(?:c\s?o\s?m|n\s?e\s?t|o\s?r\s?g)\b/i

I'm using it without good results for this format:

bla bla www. site. net. bla bla

Have you any idea?

There are no digits in that URI.

If this becomes common, change the \d{2,6} to \d{0,6}, but that will increase the risk of FP somewhat.

Dan: there are no parentheses in that RE that attempt to match the message text, they are all grouping parentheses.

--
 John Hardin KA7OHZ                    http://www.impsec.org/~jhardin/
 jhar...@impsec.org    FALaholic #11174     pgpk -a jhar...@impsec.org
 key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
  The Constitution is a written instrument. As such its meaning does
  not alter. That which it meant when adopted, it means now.
                    -- U.S. Supreme Court
                       SOUTH CAROLINA v. US, 199 U.S. 437, 448 (1905)
-----------------------------------------------------------------------
 10 days until the 40th anniversary of Apollo 11 landing on the Moon

Reply via email to