Bug#99933: second attempt at more comprehensive unicode policy

2003-01-02 Thread Colin Walters
On Thu, 2003-01-02 at 13:57, Colin Walters wrote: > #99933 goes a lot farther than #174982. I have a counter-proposal to #99933, which I have attached. I believe it fixes the problems I raised with your proposal, and should also cover some new areas (like filenames). I also hopefully fixed Jame

Bug#175064: Debian policy documents should be UTF-8 encoded

2003-01-02 Thread Colin Walters
On Thu, 2003-01-02 at 16:28, Josip Rodin wrote: > I'm not seeing that with the copy of policy.txt.gz which I generated myself. > Looks like debiandoc2text on Manoj's system used a different, Latin1 locale > and replaced Š for © on my Latin2 system it did no such (foolish) thing. > For the record,

Bug#174982: [PROPOSAL]: Debian changelogs should be UTF-8 encoded

2003-01-02 Thread Lars Wirzenius
> install: > . > . > iconv -f iso-8859-15 -t utf-8 < debian/changelog > > debian/tmp/usr/share/doc/$(PACKAGE)/changelog.Debian > gzip -9 debian/tmp/usr/share/doc/$(PACKAGE)/changelog.Debian Would it not make more sense to do the conversion once, in debian/changelog, rather

Bug#175064: Debian policy documents should be UTF-8 encoded

2003-01-02 Thread Josip Rodin
On Thu, Jan 02, 2003 at 02:01:03PM -0500, Colin Walters wrote: > > Umm, maybe because we use ©? > > Right, but the output documents seem to be ISO-8859-1 encoded. For > example: > > [EMAIL PROTECTED]> zcat /usr/share/doc/debian-policy/policy.txt.gz | iconv > --from-code=UTF-8 --to-code=UTF-8 1>

Bug#174982: [PROPOSAL]: Debian changelogs should be UTF-8 encoded

2003-01-02 Thread Colin Walters
On Thu, 2003-01-02 at 12:37, James Troup wrote: > Colin Walters <[EMAIL PROTECTED]> writes: > > > + http://www.cis.ohio-state.edu/cgi-bin/rfc/rfc2279.html"; > > name="UTF-8"> > > Could we please have a more, err, generic URL for the RFC? The Ohio State CIS department has long been one of th

Bug#175064: Debian policy documents should be UTF-8 encoded

2003-01-02 Thread Colin Walters
On Thu, 2003-01-02 at 13:24, Josip Rodin wrote: > Umm, maybe because we use ©? Right, but the output documents seem to be ISO-8859-1 encoded. For example: [EMAIL PROTECTED]> zcat /usr/share/doc/debian-policy/policy.txt.gz | iconv --from-code=UTF-8 --to-code=UTF-8 1>/dev/null iconv: illegal inp

Bug#174982: [PROPOSAL]: Debian changelogs should be UTF-8 encoded

2003-01-02 Thread David B Harris
On 02 Jan 2003 17:37:13 + James Troup <[EMAIL PROTECTED]> wrote: > Colin Walters <[EMAIL PROTECTED]> writes: > > > + > id="http://www.cis.ohio-state.edu/cgi-bin/rfc/rfc2279.html"; > > name="UTF-8"> > > Could we please have a more, err, generic URL for the RFC? http://www.faqs.org/ftp/rf

Re: Bug#174982: [PROPOSAL]: Debian changelogs should be UTF-8 encoded

2003-01-02 Thread Colin Walters
On Thu, 2003-01-02 at 13:12, Radovan Garabik wrote: > see #99933. It has been seconded and generally accepted at the time, > but policy freeze caused it did not get into policy then. Hmm, I searched the policy bug list, I don't know how I missed those. Probably my fault for using galeon-snapshot

Bug#175064: Debian policy documents should be UTF-8 encoded

2003-01-02 Thread Josip Rodin
On Thu, Jan 02, 2003 at 12:19:39PM -0500, Colin Walters wrote: > Package: debian-policy > > On Thu, 2003-01-02 at 11:48, Clint Adams wrote: > > > This proposal is a fairly important yet easy to take first step along > > > the way of transitioning all of Debian to UTF-8. > > > > > > Attached is a

Bug#174982: [PROPOSAL]: Debian changelogs should be UTF-8 encoded

2003-01-02 Thread Radovan Garabik
On Wed, Jan 01, 2003 at 10:07:08PM -0500, Colin Walters wrote: > Package: debian-policy > ... > > Attached is a patch against the latest version of policy. see #99933. It has been seconded and generally accepted at the time, but policy freeze caused it did not get into policy then. Perhaps it

Bug#174982: [PROPOSAL]: Debian changelogs should be UTF-8 encoded

2003-01-02 Thread James Troup
Colin Walters <[EMAIL PROTECTED]> writes: > + http://www.cis.ohio-state.edu/cgi-bin/rfc/rfc2279.html"; > name="UTF-8"> Could we please have a more, err, generic URL for the RFC? -- James

Bug#174982: [PROPOSAL]: Debian changelogs should be UTF-8 encoded

2003-01-02 Thread Colin Watson
On Thu, Jan 02, 2003 at 11:54:08AM -0500, Colin Walters wrote: > Ok, fair enough. Here's a patch which makes it a "should". Given my > sample with lintian though, I don't think we'll have too many packages > to fix. > > Any seconds? > --- policy.sgml~ 2002-11-15 01:49:40.0 -0500 >

Bug#175064: Debian policy documents should be UTF-8 encoded

2003-01-02 Thread Colin Walters
Package: debian-policy On Thu, 2003-01-02 at 11:48, Clint Adams wrote: > > This proposal is a fairly important yet easy to take first step along > > the way of transitioning all of Debian to UTF-8. > > > > Attached is a patch against the latest version of policy. > > Seconded. The policy docume

Bug#174982: [PROPOSAL]: Debian changelogs should be UTF-8 encoded

2003-01-02 Thread Clint Adams
> This proposal is a fairly important yet easy to take first step along > the way of transitioning all of Debian to UTF-8. > > Attached is a patch against the latest version of policy. Seconded. The policy documents should probably be converted to UTF-8 too. pgp1u8pDVFCTE.pgp Description: PGP

Bug#174982: [PROPOSAL]: Debian changelogs should be UTF-8 encoded

2003-01-02 Thread Colin Walters
On Thu, 2003-01-02 at 09:12, Colin Watson wrote: > Policy applies to all packages, whether they state that they comply with > it yet or not. I'd also go with "should" in the meantime until a survey > is done and we know how many changelogs need to be fixed, but I think in > the long run it ought t

Bug#174982: lintian patch

2003-01-02 Thread Colin Walters
Well, I whipped up a quick lintian patch to check for this, which I will submit against lintian once this proposal gathers the necessary seconds. I've attached it for now in case anyone else wants to see how many packages are broken on their machine. Here are some statistics that may be of intere

Bug#174982: [PROPOSAL]: Debian changelogs should be UTF-8 encoded

2003-01-02 Thread Junichi Uekawa
> > > | Right now, people are putting whatever random characters they feel like > > > | in Debian changelogs; > > > > > > I think we shouldn't use must just yet, since this will cause a lot of > > > packages (you know how many?) to be instantly buggy. If you change > > > the ?must? to ?should?, I

Bug#174982: [PROPOSAL]: Debian changelogs should be UTF-8 encoded

2003-01-02 Thread Manoj Srivastava
>>"Junichi" == Junichi Uekawa <[EMAIL PROTECTED]> writes: >> I think we shouldn't use must just yet, since this will cause a lot of >> packages (you know how many?) to be instantly buggy. If you change >> the «must» to «should», I'll second the proposal. Junichi> Erm.. no, only those package

Bug#174982: [PROPOSAL]: Debian changelogs should be UTF-8 encoded

2003-01-02 Thread Colin Watson
On Thu, Jan 02, 2003 at 06:09:31PM +0900, Junichi Uekawa wrote: > > | Right now, people are putting whatever random characters they feel like > > | in Debian changelogs; they might be encoded in ISO-8859-1, BIG5, > > | ISO-8859-2, ISO-2022-JP, or who knows what. This does come up in the > > | rea

Bug#174982: [PROPOSAL]: Debian changelogs should be UTF-8 encoded

2003-01-02 Thread Josip Rodin
On Thu, Jan 02, 2003 at 06:09:31PM +0900, Junichi Uekawa wrote: > > | Right now, people are putting whatever random characters they feel like > > | in Debian changelogs; > > > > I think we shouldn't use must just yet, since this will cause a lot of > > packages (you know how many?) to be instantly

Bug#174982: [PROPOSAL]: Debian changelogs should be UTF-8 encoded

2003-01-02 Thread Junichi Uekawa
> | Right now, people are putting whatever random characters they feel like > | in Debian changelogs; they might be encoded in ISO-8859-1, BIG5, > | ISO-8859-2, ISO-2022-JP, or who knows what. This does come up in the > | real world; I use apt-listchanges, and I fairly often see broken > | charact

Bug#174982: [PROPOSAL]: Debian changelogs should be UTF-8 encoded

2003-01-02 Thread Colin Walters
[ No need to CC me; I am subscribed to -policy ] On Thu, 2003-01-02 at 00:23, David B Harris wrote: > Could you provide a quick background about what Unicode is Sure. Essentially Unicode is a universal character set, used to encode all the world's languages, plus other symbols from mathematics

Bug#174982: [PROPOSAL]: Debian changelogs should be UTF-8 encoded

2003-01-02 Thread Tollef Fog Heen
* Colin Walters | Support for Unicode, and specifically UTF-8, is steadily increasing | among popular applications in Debian. For example, in unstable, GNOME 2 | has excellent support (almost level 2) in almost all its applications; | the big remaining one is gnome-terminal, of which one require