Fredrik Lundh escreveu: > > Except it appears to be buggy or, at least, not very robust. There are > > websites for which it falsely terminates early in the parsing. > > which probably means that the sites are broken. the amount of broken > HTML on the net is staggering, as is the amount of code in a typical web > browser for dealing with all that crap. for a more tolerant parser, see: > > http://www.crummy.com/software/BeautifulSoup/ > > </F>
+1 for BeautifulSoup. The documentation is quite brief and sometimes confusing, but I've found it the easiest parser I've ever worked with. Stephen -- http://mail.python.org/mailman/listinfo/python-list