William Ballard wrote:
[...]
But back to Linux.
$echo hi | iconv -f utf8 -t unicode | grep hi
(no output)

Not surprised; grep understands ASCII, AFAIK, so what you've just sent to it is:

$ echo hi | iconv -f utf8 -t unicode | od -t x1
0000000 ff fe 68 00 69 00 0a 00

It can't find an 'h' and an 'i' next to each other. That's why I mentioned UTF8; UTF8 has the nice property that anything that can be represented in plain ASCII *is*, and all other characters are high-bit, which grep and friends will pass straight through.

It's still an ad-hoc solution, though; does anyone know of versions of the standard textutils that know about Unicode?

--
[insert interesting .sig here]


-- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]



Reply via email to