Silvan Jegen dixit: >That sounds reasonable but requires that we convert UTF-8 to UTF-32 >which should not be strictly necessary when we only map one UTF-8 value >to another.
Arrgh, no. UTF-8 and UTF-32/UCS-4 are encodings of numerical Unicode codepoints. When working with text documents, you always operate on those codepoints. This was true for single-byte encodings as well, except there, the codepoints always fit into bytes. bye, //mirabilos -- 08:05⎜<XTaran:#grml> mika: Does grml have an tool to read Apple ⎜ System Log (asl) files? :) 08:08⎜<ft:#grml> yeah. /bin/rm. ;) 08:09⎜<mrud:#grml> hexdump -C 08:31⎜<XTaran:#grml> ft, mrud: *g*