You might consider requiring all character sets be able to convert to Unicode, and 
otherwise only have to know how to convert other
character sets to it's own set.

-----Original Message-----
From: Tom Hughes [mailto:[EMAIL PROTECTED]]
Sent: Monday, October 29, 2001 02:31 PM
To: [EMAIL PROTECTED]
Subject: Re: String rationale


In message <[EMAIL PROTECTED]>
          Dan Sugalski <[EMAIL PROTECTED]> wrote:

> At 04:23 PM 10/27/2001 +0100, Tom Hughes wrote:
>
> >Attached is my first pass at this - it's not fully ready yet but
> >is something for people to cast an eye over before I spend lots of
> >time going down the wrong path ;-)
>
> It looks pretty good on first glance.

I've done a bit more work now, and the latest version is attached.

This version can do transcoding. The intention is that there will be
some sort of cache in chartype_lookup_transcoder to avoid repeating
the expensive lookups by name too much.

One interesting question is who is responsible for transcoding
from character set A to character set B - is it A or B? and how
about the other way?

My code currently allows either set to provide the transform on the
grounds that otherwise the unicode module would have to either know
how to convert to everything else or from everything else.

Tom

--
Tom Hughes ([EMAIL PROTECTED])
http://www.compton.nu/

Reply via email to