Re: BeautifulSoup bug when ">>>" found in attribute value

2006-12-28 Thread Duncan Booth
"Anne van Kesteren" <[EMAIL PROTECTED]> wrote: >> Mind you, the sentence before that says 'should' for quoting < >> characters which is just plain silly. > > For quoted attribute values it isn't silly at all. It's actually part > of how HTML works. > Yes, but the sentence I was complaining abou

Re: BeautifulSoup bug when ">>>" found in attribute value

2006-12-28 Thread Anne van Kesteren
Duncan Booth schreef: > The /> was in the original input that you gave it: > > > > You don't actually *have* to escape > when it appears in html. You don't have to escape it in XML either, except when it's preceded by ]]. > As I said before, it looks like BeautifulSoup decided that the tag ende

Re: BeautifulSoup bug when ">>>" found in attribute value

2006-12-27 Thread Duncan Booth
John Nagle <[EMAIL PROTECTED]> wrote: > It's worse than that. Look at the last line of BeautifulSoup > output: > > &linkurl;=/Europe/Spain/Madrid/Apartments/Offer/2408" /> > > That "/>" doesn't match anything. We're outside a tag at that point. > And it was introduced by Beautifu

Re: BeautifulSoup bug when ">>>" found in attribute value

2006-12-27 Thread John Nagle
Duncan Booth wrote: > John Nagle <[EMAIL PROTECTED]> wrote: > > >>And this came out, via prettify: >> >>>url="http%3A//www.apartmentsapart.com/Europe/Spain/Madrid/FAQ"> >> > value="/images/offersBanners/sw04.swf?binfot=We offer >>fantastic rates for selected weeks or days!!&blinkt=Click

Re: BeautifulSoup bug when ">>>" found in attribute value

2006-12-27 Thread Duncan Booth
John Nagle <[EMAIL PROTECTED]> wrote: > And this came out, via prettify: > > url="http%3A//www.apartmentsapart.com/Europe/Spain/Madrid/FAQ"> >value="/images/offersBanners/sw04.swf?binfot=We offer > fantastic rates for selected weeks or days!!&blinkt=Click here > >>>&linkurl=/Europe/

BeautifulSoup bug when ">>>" found in attribute value

2006-12-26 Thread John Nagle
This, which is from a real web site, went into BeautifulSoup: And this came out, via prettify: >>&linkurl;=/Europe/Spain/Madrid/Apartments/Offer/2408" /> BeautifulSoup seems to have become confused by the ">>>" within a quoted attribute value. It first parsed it right, but then stuck