Hi, I'm going to write a program that extracts the structure of HTML documents. The structure would be in the form of a tree, separating the tags and grouping the start and end tags. I think I will use htmllib.HTMLParser, is it appropriate for my application? If so, I believe I will need to keep track of the depth reached.
Any tips for such application will be much appreciated. Cheers, Michael -- http://mail.python.org/mailman/listinfo/python-list