On Tue, Feb 12, 2002 at 09:58:58AM +0200, Shachar Shemesh wrote:
> Sorry for retreating in the thread, but an important note struck me from 
> my past.
> 
> Nadav Har'El wrote:
> 
> >No, UNIX traditionally operates on strings of "chars" (bytes/octets). No
> >special treatment is ever given by system calls to any byte except null
> >(and "/" in pathnames)
> >
> Ok, what if the locale allows "/" as a valid byte?
> 
> Think that is outragous? Then either think again, or try to port your 
> app to Japanese. I am not 100% sure about "/", but "\" is a legitimate 
> second-byte in some Japanese MBCS encoded characters. If your locale is 
> Japanese, these characters are taken in as a whole, and just make out a 
> path. If not, well, you can't use Japanese characters (unicode 
> non-withstanding).
> 
> Now, I don't know the UTF-8 encoding, so I don't know how likely it is 
> to happen there. Some attempt was made to avoid problematic characters. 
> MBCS made sure null cannot be a second byte, for example. I do know that 
> trying to use non-UTF to encode Japanese will require some OS support in 
> the parsing of the string.

In UTF-8 it's a non-issue. BTW, you don't have to go as far as STFW to
find that; simply do 'man utf-8' (which, BTW, was written in '95. Not
very new, and still few people know about it).

> 
>                 Shachar
> 
> 
> 
> =================================================================
> To unsubscribe, send mail to [EMAIL PROTECTED] with
> the word "unsubscribe" in the message body, e.g., run the command
> echo unsubscribe | mail [EMAIL PROTECTED]

        Didi


=================================================================
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word "unsubscribe" in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]

Reply via email to