Tx monty for the fix! On Fri, May 5, 2017 at 7:28 PM, monty <mon...@programmer.net> wrote:
> This should be fixed now. Thanks for the bug report. > > > Sent: Wednesday, May 03, 2017 at 4:44 PM > > From: "Udo Schneider" <udo.schnei...@homeaddress.de> > > To: pharo-users@lists.pharo.org > > Subject: [Pharo-users] XMLHTMLParser Entity Handling oddity > > > > All, > > > > I'm hitting an interesting issue with XMLHTMLParser and I'm not even > > sure if this is a bug or intended behaviour. Given an HTML Entity in a > > String it's resolved or quoted depending on the tag (header or section > tag): > > > > doc := XMLHTMLParser parse: > > '<html><head><title>Ü</title></head><body>Ü</body></html>'. > > (doc findElementNamed: 'title') contentString. "'Ü'" > > (doc findElementNamed: 'body') contentString. "'Ü'" > > > > In my understanding and according to > > https://www.w3.org/TR/html401/struct/global.html#h-7.4.2 Entities in the > > title tag are allowed and should IMHO be resolved. > > > > So both should return 'Ü' in this case. > > > > Any pointers? > > > > CU, > > > > Udo > > > > > > > >