Re: Matching invalid characters in a URL

2004-01-12 Thread Rob Dixon
Kevin Zembower wrote: > > Thank you all for some first thoughts and clarifying questions. > > I'm trying to discard any URL with any character that is not an upper- or lower-case > letter, digit, or the characters $-_.+!*'(), . I realize that some other characters > can be > used in special circum

RE: Matching invalid characters in a URL

2004-01-12 Thread Charles K. Clarkson
KEVIN ZEMBOWER <[EMAIL PROTECTED]> wrote: : : I'm trying to discard any URL with any character that is not : an upper- or lower-case letter, digit, or the characters : $-_.+!*'(), . I realize that some other characters can be : used in special circumstances, but I don't have to allow for : any of

Re: Matching invalid characters in a URL

2004-01-12 Thread KEVIN ZEMBOWER
Thank you all for some first thoughts and clarifying questions. I'm trying to discard any URL with any character that is not an upper- or lower-case letter, digit, or the characters $-_.+!*'(), . I realize that some other characters can be used in special circumstances, but I don't have to allo

Re: Matching invalid characters in a URL

2004-01-11 Thread Dan Anderson
On Fri, 2004-01-09 at 16:54, Wiggins d Anconia wrote: > > > Any suggestions? Thanks for your help and thoughts. > > > > It is much easier to define the set all chars must be in then not. Use > > the =! which is the complement of all charachters matched by =~. > > Alternatively, I believe there i

Re: Matching invalid characters in a URL

2004-01-09 Thread Wiggins d Anconia
> I'm trying to throw out URLs with any invalid characters in them, like > '@". According to http://www.ietf.org/rfc/rfc1738.txt : >Thus, only alphanumerics, the special characters "$-_.+!*'(),", and >reserved characters used for their reserved purposes may be used >unencoded within a

Re: Matching invalid characters in a URL

2004-01-09 Thread Wiggins d Anconia
> > Any suggestions? Thanks for your help and thoughts. > > It is much easier to define the set all chars must be in then not. Use > the =! which is the complement of all charachters matched by =~. > Alternatively, I believe there is a c option you can use. > > -Dan That (I presume) should b

Re: Matching invalid characters in a URL

2004-01-09 Thread Dan Anderson
> Any suggestions? Thanks for your help and thoughts. It is much easier to define the set all chars must be in then not. Use the =! which is the complement of all charachters matched by =~. Alternatively, I believe there is a c option you can use. -Dan -- To unsubscribe, e-mail: [EMAIL PROTE

Matching invalid characters in a URL

2004-01-09 Thread KEVIN ZEMBOWER
I'm trying to throw out URLs with any invalid characters in them, like '@". According to http://www.ietf.org/rfc/rfc1738.txt : Thus, only alphanumerics, the special characters "$-_.+!*'(),", and reserved characters used for their reserved purposes may be used unencoded within a URL. I'd l