On Jul 8, 2010, at 10:04 PM, Jerry Krinock wrote: > Consider this XML: > > <?xml version="1.0" encoding="UTF-8"?> > <foos> > <foo location="San José, > California" /> > </foos> > > wherein there is a linefeed (0x0a) followed by two spaces between the command > and "California". The accented é is represented by two bytes, 0xa3 0xc9. > It's all nice UTF-8, as indicated in the header. > > When I parse this using NSXMLParser, the value for the key "location" in the > attributes dictionary comes out as: > > San José, California > > wherein there are three spaces (0x20) between the comma and "California". > > The accented é gets parsed correctly, as UTF8, but the linefeed 0x0a gets > changed to a space. Is this a bug?
Do you know for a fact that it's the LF that gets changed? I would experiment with removing the linefeed, then removing the accented é, then removing the space, to see exactly what triggers the bad behavior. ---------------------------------------------------------------- Steve Bird Culverson Software - Elegant software that is a pleasure to use. www.Culverson.com (toll free) 1-877-676-8175 _______________________________________________ Cocoa-dev mailing list (Cocoa-dev@lists.apple.com) Please do not post admin requests or moderator comments to the list. Contact the moderators at cocoa-dev-admins(at)lists.apple.com Help/Unsubscribe/Update your Subscription: http://lists.apple.com/mailman/options/cocoa-dev/archive%40mail-archive.com This email sent to arch...@mail-archive.com