Re: Parsing HTML

John Nagle Fri, 23 Feb 2007 22:11:11 -0800

    BeautifulSoup does parse HTML well, but there are a few issues:

    1.  It's rather slow; it can take seconds of CPU time to parse
some larger web pages.


    2.  There's no error reporting.  It tries to do the right thing,
but when it doesn't, you have no idea what went wrong.

BeautifulSoup would be a good test case for the PyPy crowd to
work on.  It really needs the speedup.

                                John Nagle

sofeng wrote:
> On Feb 8, 11:43 am, "metaperl" <[EMAIL PROTECTED]> wrote:
>>On Feb 8, 2:38 pm, "mtuller" <[EMAIL PROTECTED]> wrote:
>>>I am trying to parse a webpage and extract information.
>>BeautifulSoup is a great Python module for this purpose:
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: Parsing HTML

Reply via email to