On Thu, Jan 02, 2003 at 02:01:03PM -0500, Colin Walters wrote: > > Umm, maybe because we use ©? > > Right, but the output documents seem to be ISO-8859-1 encoded. For > example: > > [EMAIL PROTECTED]> zcat /usr/share/doc/debian-policy/policy.txt.gz | iconv > --from-code=UTF-8 --to-code=UTF-8 1>/dev/null > iconv: illegal input sequence at position 689 > [EMAIL PROTECTED]>
I'm not seeing that with the copy of policy.txt.gz which I generated myself. Looks like debiandoc2text on Manoj's system used a different, Latin1 locale and replaced Š for © on my Latin2 system it did no such (foolish) thing. For the record, Š is a large latin letter S with a hacek/caron. :) We should probably restrict the build process with LANG=C or something like that. -- 2. That which causes joy or happiness.