On Jan 31, 2010, at 9:42 AM, Andrew Thompson wrote: > 0x80 to 0x9F in codepage 1252 inclues the Euro sign, the bullet (option-8 on > the mac) the en-dash and em-dash... i.e. all things that will be found even > in English text. > (Reference http://msdn.microsoft.com/en-us/goglobal/cc305145.aspx) > These can all be represented in unicode, but you'd have to run the text > through a converter. Which will lead to the question, how do you know the > encoding of what was pasted in?
This is a false alarm — the OP already clarified that he's not talking about UTF-8 encoding, but rather Unicode characters that are illegal in XML (such as nulls). So the issue really has nothing to do with CP1252 or other encodings. —Jens_______________________________________________ Cocoa-dev mailing list (Cocoa-dev@lists.apple.com) Please do not post admin requests or moderator comments to the list. Contact the moderators at cocoa-dev-admins(at)lists.apple.com Help/Unsubscribe/Update your Subscription: http://lists.apple.com/mailman/options/cocoa-dev/archive%40mail-archive.com This email sent to arch...@mail-archive.com