Re: [sword-devel] BOM and Osis2Mod

2009-01-07 Thread David Haslam
You can easily remove the BOM from a UTF-8 text file. Windows users can make use of the free version of http://www.unipad.org/ SC Unipad . This application can also convert from UTF-8 to UTF-16 etc and vice versa. http://notepad-plus.sourceforge.net/ Notepad++ can also do these conversions.

Re: [sword-devel] BOM and Osis2Mod

2009-01-06 Thread DM Smith
Wolfgang, I guess I wasn't clear enough. I think all detected errors should produce a message. And what I didn't say, but meant to, was that I'll output an error message for an unsupported BOM. In Him, DM On Jan 6, 2009, at 11:15 AM, Wolfgang Schultz wrote: Hello, here we see th

Re: [sword-devel] BOM and Osis2Mod

2009-01-06 Thread Wolfgang Schultz
Hello, here we see the difference between user centric programming and ideological influence on how to make programms :)) On my opinion i would change osis2mod ( and also vpl2mod) so that it isn't an obstacle for using in any situation BOM or not BOM. However osis2mod should print out a

Re: [sword-devel] BOM and Osis2Mod

2009-01-06 Thread DM Smith
Ian Wagner wrote: See http://unicode.org/faq/utf_bom.html#bom1. It indicates the encoding that the file is written in, be it big-endian (most siginificant byte to least significant byte-- the way that we write, and the way the PowerPC stores numbers in memory), or little-endian (the reverse an

Re: [sword-devel] BOM and Osis2Mod

2009-01-06 Thread Chris Little
Wolfgang Schultz wrote: Hello, if the utf-8 OSIS file has a BOM ( Byte Order Mark some editors insert one) Osis2Mod will fail to make a sword-modul, it were nice if this would be fixed, because it will cause lot of problems in further steps :( The BOM in UTF-8 is generally a Windows-ism. No

Re: [sword-devel] BOM and Osis2Mod

2009-01-06 Thread Ian Wagner
Actually, if I would have read down further, it explicitly says "Yes, UTF-8 can contain a BOM. However, it makes no difference as to the endianness of the byte stream. UTF-8 always has the same byte order. An initial BOM is only used as a signature — an indication that an otherwise unmarked

Re: [sword-devel] BOM and Osis2Mod

2009-01-06 Thread Ian Wagner
See http://unicode.org/faq/utf_bom.html#bom1. It indicates the encoding that the file is written in, be it big-endian (most siginificant byte to least significant byte-- the way that we write, and the way the PowerPC stores numbers in memory), or little-endian (the reverse and the way intel

Re: [sword-devel] BOM and Osis2Mod

2009-01-06 Thread DM Smith
Wolfgang Schultz wrote: Hello, if the utf-8 OSIS file has a BOM ( Byte Order Mark some editors insert one) Osis2Mod will fail to make a sword-modul, it were nice if this would be fixed, because it will cause lot of problems in further steps :( What does a byte order mark mean for a UTF-8 fi