On Jan 11, 2009, at 2:22 PM, Kyle Sluder wrote:

On Sun, Jan 11, 2009 at 1:56 PM, Greg Titus <g...@omnigroup.com> wrote:
Cocoa does use UTF-16 as the encoding for its unichar type in NSStrings, but the low 7-bits of UTF-16 (characters 0-127) are identical to the ASCII encodings, so you can cast (char)'7' or (unichar)'7' and get 0x37 either way and it'll be interpreted as the '7' character in a string as you would
expect.

Doesn't that depend on the byte order?  What happens if the string is
stored big-endian with a BOM at the beginning, and you go and read out
a unichar?

An NSString, through its character/unichar interface, always provides native endian values. Only if/when you convert to a specific encoding to produce a byte stream does endianness enter into the discussion. But then, you should never be comparing individual bytes from such a stream to character values. You would always need to decode back to (inherently native-endian) character values before doing any character- based operations.

Regards,
Ken

_______________________________________________

Cocoa-dev mailing list (Cocoa-dev@lists.apple.com)

Please do not post admin requests or moderator comments to the list.
Contact the moderators at cocoa-dev-admins(at)lists.apple.com

Help/Unsubscribe/Update your Subscription:
http://lists.apple.com/mailman/options/cocoa-dev/archive%40mail-archive.com

This email sent to arch...@mail-archive.com

Reply via email to