Angus Leeming <[EMAIL PROTECTED]> writes:

| Abdelrazak Younes <[EMAIL PROTECTED]> writes:
| >> Can you try to change the utf-8 to ucs-4 conversion to use either
| >> "UCS-4BE" or "UCS-4LE", instead of "UCS-4"? Also the conversion the
| >> other way ucs-4 -> ucs-2, try with UCS-2BE and UCS-2LE.
| 
| > And with the attached patch where I have put LE everywhere, the text is 
| > displayed correctly but the inset buttons are not. I guess I have gone 
| > too far... we need a conbination of the second patch 
| > (unicode_little_endian) and this one (unicode_little_endian_full).
| 
| Intel-based PCs use Little Endian byte order. Often Windows file formats use 
| the BOM (http://en.wikipedia.org/wiki/Byte_Order_Mark) to make it trivial for 
| the executable decoding the UTF-8 file format to decide whether the file was 
| stored in Big Endian or Little Endian format. The BOM tends not to be used on 
| Unix machines though as it messes with the sh-bang mechanism and isn't 
| actually needed at all anyway...

UTF-8 is chars so BOM has no place there...

| The fact that you need to tell LyX to convert your files from UTF-8 to the 
| Little Endian flavour of UCS-4 etc suggests that your UTF-8 files are encoded 
| in Big Endian format.

There are no such thing.


-- 
        Lgb

Reply via email to