HTML Structure Extraction

dayzman Tue, 07 Dec 2004 22:35:04 -0800

Hi,

I'm going to write a program that extracts the structure of HTML
documents. The structure would be in the form of a tree, separating the
tags and grouping the start and end tags. I think I will use
htmllib.HTMLParser, is it appropriate for my application? If so, I
believe I will need to keep track of the depth reached.


Any tips for such application will be much appreciated.

Cheers,
Michael

-- 
http://mail.python.org/mailman/listinfo/python-list

HTML Structure Extraction

Reply via email to