Re: Parsing HTML?

Stefan Behnel Sun, 06 Apr 2008 23:08:23 -0700

Benjamin wrote:
> I'm trying to parse an HTML file.  I want to retrieve all of the text
> inside a certain tag that I find with XPath.  The DOM seems to make
> this available with the innerHTML element, but I haven't found a way
> to do it in Python.


    import lxml.html as h
    tree = h.parse("somefile.html")
    text = tree.xpath("string( some/[EMAIL PROTECTED] )")

http://codespeak.net/lxml

Stefan
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: Parsing HTML?

Reply via email to