Re: sgmllib.py

2009-08-25 Thread Nobody
On Mon, 24 Aug 2009 09:08:07 +0200, Stefan Behnel wrote: > But note that sgmllib is a particularly cumbersome way to deal with HTML. Mostly because it only provides a tokeniser, not a parser. Whoever wrote it doesn't appear to understand the difference. -- http://mail.python.org/mailman/listinf

Re: sgmllib.py

2009-08-24 Thread Stefan Behnel
Dave Angel schrieb: > elsa wrote: >> python sgmllib.py "path/to/my/file.html" example (1) > > The path in the error message simply refers to the full path string to > your Python interpreter, and reflects %0 in your shell. So I'd assume > you've got a script called 'python' on your path, whi

Re: sgmllib.py

2009-08-24 Thread Dave Angel
elsa wrote: Hi all, I'm new to both this forum and Python, and I've got a bit stuck trying to learn how to parse HTML here is my problem I'm using a textbook that uses sgmllib.py for all its examples. I'm aware that sgmllib is not in the current release, however I want to get it to work, as

Re: sgmllib.py

2009-08-24 Thread Stefan Behnel
elsa wrote: > I'm new to both this forum and Python, and I've got a bit stuck trying > to learn how to parse HTML... If what you want to do is *parse* the HTML instead of trying to *learn* how to parse it, you might want to give the existing (external) HTML parser libraries a try. There's lxml.htm