"Justin Ezequiel" <[EMAIL PROTECTED]> wrote:
> On Mar 29, 4:08 pm, Duncan Booth <[EMAIL PROTECTED]> wrote: >> John Nagle <[EMAIL PROTECTED]> wrote: >> > title="<!--http://www.microsoft.com/usability/information.mspx->" >> >> > is supposed to be an HTML comment. But it's improperly terminated. >> >> It is an attribute value, and unescaped angle brackets are valid in >> attributes. It looks to me like a bug in BeautifulSoup. > > FWIW, see http://tinyurl.com/yjtzjz > > new fan of BeautifulSoup here as it helped me parse "BAD" XML > (although my client would disagree with that description) > I'm right behind BeautifulSoup's ability to parse bad HTML, but I still think it should give priority to being able to parse valid HTML withough messing it up. -- http://mail.python.org/mailman/listinfo/python-list