Re: Questions about Exceptions & Re: What Unicode means to us

2004-08-14 Thread Larry Wall
On Sat, Aug 14, 2004 at 12:26:23AM -0400, Michael Stone wrote: : Finally, (and I'm just thinking about charset stuff here), exceptions : frequently involve a bit of string processing. So if the : charset/encoding code starts throwing exceptions, how do we write the : string processing that thos

Questions about Exceptions & Re: What Unicode means to us

2004-08-13 Thread Michael Stone
Larry Wall wrote: On Wed, Aug 11, 2004 at 04:16:48PM -0400, Dan Sugalski wrote: : If it's that handy then someone can write a library routine. This : feels very much to be in the same category as running a : speech-to-text algorithm if you send WAV data to a text filehandle. Well, you can write

Re: What Unicode means to us

2004-08-13 Thread Michael Stone
Aaron, I happen to agree with Dan about the unwieldiness of replacing characters with their full names during character translation, but your idea of using Unicode equivalents seems more palatable. I'm going to ignore the issue of how this method of handling errors fits into the scheme o

Re: What Unicode means to us

2004-08-13 Thread Larry Wall
On Wed, Aug 11, 2004 at 04:16:48PM -0400, Dan Sugalski wrote: : If it's that handy then someone can write a library routine. This : feels very much to be in the same category as running a : speech-to-text algorithm if you send WAV data to a text filehandle. Well, you can write a library routine,

Re: What Unicode means to us

2004-08-13 Thread Dan Sugalski
At 7:37 PM +1000 8/10/04, Adam Richardson wrote: I'm not sure how you plan to integrate the database level (or whether it affects what you are doing at all), but presumably you know all about the new encoding and collation sets in mySQL 4.1. Things have changed quite a bit there from 4.0, and I've

Re: What Unicode means to us

2004-08-13 Thread Dan Sugalski
At 10:48 AM -0400 8/11/04, Aaron Sherman wrote: I don't want to argue per-se (that doesn't do anyone any good), so if your mind is made up, that's cool... still, I think there's some value in exploring the options, so read on if you're so inclined. On Wed, 2004-08-11 at 04:40, Dan Sugalski wrote:

Re: What Unicode means to us

2004-08-13 Thread Aaron Sherman
I don't want to argue per-se (that doesn't do anyone any good), so if your mind is made up, that's cool... still, I think there's some value in exploring the options, so read on if you're so inclined. On Wed, 2004-08-11 at 04:40, Dan Sugalski wrote: > > Converting Unicode to non-Unicode c

Re: What Unicode means to us

2004-08-13 Thread Aaron Sherman
On Mon, 2004-08-09 at 14:14, Dan Sugalski wrote: > Additionally if we have source text which is > Latin-n, EBCDIC, ASCII, or whatever we must be > able to convert it with no loss to Unicode. > (Which I believe is now doable with Unicode 4.0) > Losslessly converting Unicode to > ASCII/EBCDIC/w

Re: What Unicode means to us

2004-08-13 Thread Dan Sugalski
At 4:15 PM -0400 8/10/04, Aaron Sherman wrote: On Mon, 2004-08-09 at 14:14, Dan Sugalski wrote: Additionally if we have source text which is Latin-n, EBCDIC, ASCII, or whatever we must be able to convert it with no loss to Unicode. (Which I believe is now doable with Unicode 4.0) Losslessly co

Re: What Unicode means to us

2004-08-10 Thread Adam Richardson
On Monday, 9 August 2004 at 4:14 AM +1000, Dan Sugalski wrote: >Since this has been a sore spot lately, and one >we need to deal with. Might as well formally >define what that is. > >We must be able to: > >*) Load in string data from an IO source, >regardless of its encoding, and treat it as >Unic

Re: What Unicode means to us

2004-08-09 Thread Nicholas Clark
On Mon, Aug 09, 2004 at 02:14:46PM -0400, Dan Sugalski wrote: > We don't care about on-screen rendering or date/time/money formatting. And whilst every language out there might need these functions available to its apps, this sounds like a module for the Comprehensive PIR Archive Network. (ie I a