-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Mike Hearn wrote:
> On Mon, 24 Apr 2006 15:27:07 -0400, Nicolas De Rico wrote:
>> I would like to compile files created on Windows and encoded in 
>> "Unicode" (UTF-8 or UTF-16).  Microsoft puts a little header at the 
>> beginning of files to indicate that they are UTF-16, UTF-8, etc.  I 
>> believe that this header is standard unicode btw, not an extension!
> 
> Are you thinking of the byte order mark (BOM)? If so then this is a quirk
> of UTF-16 and is a Windows thing that many apps can't handle correctly ...
> UTF-8 should not have any headers at all and GCC should handle them fine.
> Try using some text editor to check it really is UTF-8.

Windows Notepad still inserts a BOM (0xEF 0xBB 0xBF) at
the beginning of files encoded with UTF-8. See:

  http://www.microsoft.com/globaldev/getwr/steps/wrg_unicode.mspx
  http://en.wikipedia.org/wiki/Byte_Order_Mark

Ranjit.

- --
Ranjit Mathew      Email: rmathew AT gmail DOT com

Bangalore, INDIA.    Web: http://rmathew.com/


-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.2.2 (MingW32)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFETfs0Yb1hx2wRS48RAkvmAKCae/o9vD3doaDKD1VPOSUlSlhRjACdGqv0
nD0cMiSvZLu9TfmIf/BUuIU=
=lZaM
-----END PGP SIGNATURE-----

Reply via email to