>> there are some goodies in urllib for doing some of this splitting. >> Example code at the bottom of my reply (though it seems to choke on >> certain protocols such as "mailto:" and "ssh:" because urlparse doesn't >> return the netloc properly) > > It doesn't? That's... bad. But for my application, probably not > important: I only care about HTTP.
This seems to be intentional, rather than a bug. In my python2.4/urlparse.py file, there's a uses_netloc list which clearly does not have 'mailto' in it. I can't give an explanation/justification for it, but it seems to me (IMHO) that there is a netloc involved in a mail address. Or maybe I have a semantic misunderstanding of what the netloc field means when returned from urlparse.urlparse However, since this is where the hostname appears in "http", it makes me think that the hostname from a mailto URL should also appear in this result field. -tkc -- http://mail.python.org/mailman/listinfo/python-list