on 三, 2009-02-18 at 02:43 +0800, Anthony Wong wrote: > I suggest 1. to convert all existing Chinese WML files for the Debian website > from Big5 to UTF-8 > > Any comments? > 如果全部转换成 UTF-8 格式可能会存在问题,假设有两个用户(一个简体,一个繁体)都 贡献了一个翻译:
% cat foo.tc 中國 % cat foo.sc 中国 % enca foo.sc foo.tc foo.sc: Universal transformation format 8 bits; UTF-8 foo.tc: Universal transformation format 8 bits; UTF-8 把简体用户贡献的翻译从 UTF-8 转到 GB2312 是正常的 ~% iconv -f utf8 -t gb2312 foo.sc > foo.sc.gb 但是把繁体用户贡献的翻译从 UTF-8 转到 GB2312 是错误的 ~% iconv -f utf8 -t gb2312 foo.tc > foo.tc.gb iconv: illegal input sequence at position 3 同理,把简体用户贡献的翻译从 UTF-8 转到 BIG5 也是错误的 ~% iconv -f utf8 -t big5 foo.sc > foo.sc.big iconv: illegal input sequence at position 3 ~% iconv -f utf8 -t big5 foo.tc > foo.tc.big ~% enca foo.* foo.sc: Universal transformation format 8 bits; UTF-8 foo.sc.big: Traditional Chinese Industrial Standard; Big5 foo.sc.gb: Simplified Chinese National Standard; GB2312 foo.tc: Universal transformation format 8 bits; UTF-8 foo.tc.big: Traditional Chinese Industrial Standard; Big5 foo.tc.gb: Simplified Chinese National Standard; GB2312 -- Vern 2009-02-18
signature.asc
Description: Digital signature