understanding htmllib

David Bear Tue, 03 Oct 2006 20:45:47 -0700

I'm trying to understand how to use the HTMLParser in htmllib but I'm not
seeing enough examples.


I just want to grab the contents of everything enclosed in a '<body>' tag,
i.e. items from where <body> begins to where </body> ends. I start by doing

class HTMLBody(HTMLParser):
   def __init__(self):
      self.contents = []

   def handle_starttag()..

Now I'm stuck. I cant see that there is a method on handle_starttag that
would return everthing to the end tag. And I haven't seen anything on how
to define my one handle_unknowntag..

Any pointers would be greatly appreciated. The documentation on this module
at python.org seems to assume a great deal about what the reader would
already know about which methods they should subclass.

-- 
David Bear
-- let me buy your intellectual property, I want to own your thoughts --
-- 
http://mail.python.org/mailman/listinfo/python-list

understanding htmllib

Reply via email to