Linas Vepstas <linasveps...@gmail.com>: > Unicode is just a bunch of bytes that are null-terminated.
That is not what Unicode is. Maybe you are thinking of UTF-8, an ingenious way to map a Unicode sequence onto a byte sequence. Trouble is, there are byte sequences that are illegal UTF-8. While they are illegal for Unicode use, they are perfectly possible and valid as far as the (Linux) operating system is concerned. Marko