Re: lists.debian.org de-localization

2003-02-12 Thread Tomohiro KUBOTA
Hi, (Remember, the topic is that http://lists.debian.org pages sometimes use 8bit characters which may break all contents after the character when east Asian users browse the pages.) From: Josip Rodin <[EMAIL PROTECTED]> Subject: Re: lists.debian.org de-localization Date: Sun, 12 Jan 2003

Re: lists.debian.org de-localization

2003-01-12 Thread Tomohiro KUBOTA
Hi, From: Josip Rodin <[EMAIL PROTECTED]> Subject: Re: lists.debian.org de-localization Date: Sun, 12 Jan 2003 04:14:45 +0100 > This, on the other hand, is a hassle to handle (backporting or installation > into subdirs). master.d.o is scheduled to be upgraded to woody after samosa.

Re: lists.debian.org de-localization

2003-01-11 Thread Josip Rodin
On Sun, Jan 12, 2003 at 10:38:52AM +0900, Tomohiro KUBOTA wrote: > However, I don't think this can be a solution now because it will take a > very long time that the version will be stable, then the stable version > will be adopted into unstable/testing version of Debian distribution, then > the di

Re: lists.debian.org de-localization

2003-01-11 Thread Tomohiro KUBOTA
Hi, From: Tomohiro KUBOTA <[EMAIL PROTECTED]> Subject: Re: lists.debian.org de-localization Date: Tue, 07 Jan 2003 21:45:05 +0900 (JST) > I think more important problem is how to deal with raw 8bit mail > headers without encoding specification or encodings which are not > s

Re: lists.debian.org de-localization

2003-01-07 Thread Tomohiro KUBOTA
Hi, From: Josip Rodin <[EMAIL PROTECTED]> Subject: Re: lists.debian.org de-localization Date: Tue, 7 Jan 2003 11:41:36 +0100 > Hm, but doesn't the section on character sets cover the mails themselves as > well? There are a bit under twenty thousand indices, which is a large a

Re: lists.debian.org de-localization

2003-01-07 Thread Josip Rodin
On Tue, Jan 07, 2003 at 09:29:33AM +0900, Tomohiro KUBOTA wrote: > I have an idea about an easy modification to old list pages. > > Add the following line to all > http://lists.debian.org/*/*/threads.html , > http://lists.debian.org/*/*/maillist.html , > http://lists.debian.org/*/*/subject.html ,

Re: lists.debian.org de-localization

2003-01-06 Thread Tomohiro KUBOTA
Hi, From: Marco d'Itri <[EMAIL PROTECTED]> Subject: Re: lists.debian.org de-localization Date: Tue, 7 Jan 2003 01:10:29 +0100 > On Jan 06, Tomohiro KUBOTA <[EMAIL PROTECTED]> wrote: > > >> This is not needed, only spammers put raw latin-1 characters in mail >

Re: lists.debian.org de-localization

2003-01-06 Thread Marco d'Itri
On Jan 06, Tomohiro KUBOTA <[EMAIL PROTECTED]> wrote: >> This is not needed, only spammers put raw latin-1 characters in mail >> headers. >The key point is that when we receive a mail with raw 8bit characters, The key point is that we should not even accept mail with raw 8bit characters in the

Re: lists.debian.org de-localization

2003-01-06 Thread Tomohiro KUBOTA
Hi, From: Josip Rodin <[EMAIL PROTECTED]> Subject: Re: lists.debian.org de-localization Date: Mon, 6 Jan 2003 16:07:49 +0100 > Future only. Is there a pressing need to regenerate the old mails? > I would rather avoid it... I have an idea about an easy modification to old list pag

Re: lists.debian.org de-localization

2003-01-06 Thread Tomohiro KUBOTA
Hi, From: Josip Rodin <[EMAIL PROTECTED]> Subject: Re: lists.debian.org de-localization Date: Mon, 6 Jan 2003 16:07:49 +0100 > On Mon, Jan 06, 2003 at 11:42:44PM +0900, Tomohiro KUBOTA wrote: > > Thank you for commiting the modification of debian.rc . Does the change > > a

Re: lists.debian.org de-localization

2003-01-06 Thread Josip Rodin
On Mon, Jan 06, 2003 at 11:42:44PM +0900, Tomohiro KUBOTA wrote: > Thank you for commiting the modification of debian.rc . Does the change > affect future archives only? Or all past and future archives? Future only. Is there a pressing need to regenerate the old mails? I would rather avoid it...

Re: lists.debian.org de-localization

2003-01-06 Thread Tomohiro KUBOTA
Hi, From: Josip Rodin <[EMAIL PROTECTED]> Subject: Re: lists.debian.org de-localization Date: Mon, 6 Jan 2003 15:09:47 +0100 > > (permission of klecker:/org/www.debian.org/cron/people_scripts/people.pl > > I have no idea how you came from mhonarc to people.pl, but okay. :)

Re: lists.debian.org de-localization

2003-01-06 Thread Josip Rodin
On Mon, Jan 06, 2003 at 03:09:47PM +0100, Josip Rodin wrote: > > > > Why not use iso_8859::str2sgml; instead of mhonarc::htmlize for > > > > iso-8859-1? > > [...] > > > Sounds like a very good idea. > > > > Who should I ask for this modification? > > This is the right place to ask, I was watchin

Re: lists.debian.org de-localization

2003-01-06 Thread Tomohiro KUBOTA
Hi, From: Edmund GRIMLEY EVANS <[EMAIL PROTECTED]> Subject: Re: lists.debian.org de-localization Date: Mon, 6 Jan 2003 13:45:47 + > If the headers contain 8-bit octets and are valid as UTF-8, it's > fairly safe to assume that they really are UTF-8. Otherwise, you could >

Re: lists.debian.org de-localization

2003-01-06 Thread Josip Rodin
On Mon, Jan 06, 2003 at 10:09:11AM +0900, Tomohiro KUBOTA wrote: > > > Why not use iso_8859::str2sgml; instead of mhonarc::htmlize for > > > iso-8859-1? > [...] > > Sounds like a very good idea. > > Who should I ask for this modification? This is the right place to ask, I was watching the discus

Re: lists.debian.org de-localization

2003-01-06 Thread Edmund GRIMLEY EVANS
Tomohiro KUBOTA <[EMAIL PROTECTED]>: > The key point is that when we receive a mail with raw 8bit characters, > we don't have an easy and relyable method to tell the characters are > from ISO-8859-1 or KOI8-R or other character sets. If the headers contain 8-bit octets and are valid as UTF-8, it'

Re: lists.debian.org de-localization

2003-01-06 Thread Tomohiro KUBOTA
Hi, From: Marco d'Itri <[EMAIL PROTECTED]> Subject: Re: lists.debian.org de-localization Date: Mon, 6 Jan 2003 13:34:17 +0100 > >Again, speaking about lists.debian.org, my original idea is to assume > >all 8bit raw characters to be ISO-8859-1, though I don't

Re: lists.debian.org de-localization

2003-01-06 Thread Marco d'Itri
On Jan 06, Tomohiro KUBOTA <[EMAIL PROTECTED]> wrote: >IMO, in such a case, allowing raw 8bit mails is better (i.e., its merit >is larger than its demerit) than disabling them. > >Again, speaking about lists.debian.org, my original idea is to assume >all 8bit raw characters to be ISO-8859-1,

Re: lists.debian.org de-localization

2003-01-06 Thread Denis Barbier
On Mon, Jan 06, 2003 at 10:09:11AM +0900, Tomohiro KUBOTA wrote: > Hi, > > From: [EMAIL PROTECTED] (Denis Barbier) > Subject: Re: lists.debian.org de-localization (Re: automatically-generated > ISO-8859-1 characters in mulbibyte webpages) > Date: Sun, 5 Jan 2003 15:33:41 +010

Re: lists.debian.org de-localization

2003-01-06 Thread Tomohiro KUBOTA
Hi, From: "Stephen J. Turnbull" <[EMAIL PROTECTED]> Subject: Re: lists.debian.org de-localization (Re: automatically-generated ISO-8859-1 characters in mulbibyte webpages) Date: Sun, 05 Jan 2003 16:10:02 +0900 > This is a fairly small sample (about 100 subscribers, 2

Re: lists.debian.org de-localization

2003-01-05 Thread Tomohiro KUBOTA
Hi, From: [EMAIL PROTECTED] (Denis Barbier) Subject: Re: lists.debian.org de-localization (Re: automatically-generated ISO-8859-1 characters in mulbibyte webpages) Date: Sun, 5 Jan 2003 15:33:41 +0100 > > Why not use iso_8859::str2sgml; instead of mhonarc::htmlize for iso-8859-1? [...] &g

Re: lists.debian.org de-localization (Re: automatically-generated ISO-8859-1 characters in mulbibyte webpages)

2003-01-05 Thread Denis Barbier
On Sun, Jan 05, 2003 at 10:18:48AM +0900, Tomohiro KUBOTA wrote: [...] > > > > plain; mhonarc::htmlize; > > us-ascii; mhonarc::htmlize; > > iso-8859-1; mhonarc::htmlize; > > iso-8859-2; iso_8859::str2sgml; iso8859.pl > > iso-8859-3; iso_8859::str2sgml; iso8859.pl

Re: lists.debian.org de-localization (Re: automatically-generated ISO-8859-1 characters in mulbibyte webpages)

2003-01-05 Thread Stephen J. Turnbull
> "Marco" == Marco d'Itri <[EMAIL PROTECTED]> writes: Marco> It would be *MUCH* better to just refuse these Marco> messages. Most of them are spam anyway. At least in my Marco> country (and in all western europe, I think) raw latin-1 Marco> characters in headers are never fou

Re: lists.debian.org de-localization (Re: automatically-generated ISO-8859-1 characters in mulbibyte webpages)

2003-01-04 Thread Marco d'Itri
On Jan 05, Tomohiro KUBOTA <[EMAIL PROTECTED]> wrote: >It would be nice if we can convert raw 8bit mail headers (though it is >illegal; it sometimes happens and may cause breaking the lists.debian.org >pages) to SGML entities by assuming they are ISO-8859-1. Since this may >annoy Russian (and