Francach wrote: > Hi, > > I'm trying to use the Beautiful Soup package to parse through the > "bookmarks.html" file which Firefox exports all your bookmarks into. > I've been struggling with the documentation trying to figure out how to > extract all the urls. Has anybody got a couple of longer examples using > Beautiful Soup I could play around with? > > Thanks, > Martin.
If the only thing you want out of the document is the URL's why not search for: href="..." ? You could get a regular expression that matches that pretty easily. I think this should just about get you there, but my regular expressions have gotten very rusty. /href=\".+\"/ -- http://mail.python.org/mailman/listinfo/python-list