In message <[EMAIL PROTECTED]>, Support Desk wrote: > Anybody know of a good regex to parse html links from html code? The one I > am currently using seems to be cutting off the last letter of some links, > and returning links like > > http://somesite.co > > or http://somesite.ph > > the code I am using is > > > regex = r'<a href=["|\']([^"|\']+)["|\']>'
Can you post some example HTML sequences that this regexp is not handling correctly? -- http://mail.python.org/mailman/listinfo/python-list