On Mon, 24 Aug 2009 09:08:07 +0200, Stefan Behnel wrote:
> But note that sgmllib is a particularly cumbersome way to deal with HTML.
Mostly because it only provides a tokeniser, not a parser. Whoever wrote
it doesn't appear to understand the difference.
--
http://mail.python.org/mailman/listinf
Dave Angel schrieb:
> elsa wrote:
>> python sgmllib.py "path/to/my/file.html" example (1)
>
> The path in the error message simply refers to the full path string to
> your Python interpreter, and reflects %0 in your shell. So I'd assume
> you've got a script called 'python' on your path, whi
elsa wrote:
Hi all,
I'm new to both this forum and Python, and I've got a bit stuck trying
to learn how to parse HTML here is my problem
I'm using a textbook that uses sgmllib.py for all its examples. I'm
aware that sgmllib is not in the current release, however I want to
get it to work, as
elsa wrote:
> I'm new to both this forum and Python, and I've got a bit stuck trying
> to learn how to parse HTML...
If what you want to do is *parse* the HTML instead of trying to *learn* how
to parse it, you might want to give the existing (external) HTML parser
libraries a try. There's lxml.htm