Re: case mangling and binary strings

2004-11-03 Thread Leopold Toetsch
Sam Ruby <[EMAIL PROTECTED]> wrote: > Python has two data types: str and unicode. Python's unicode "features" are probably not really good examples generally. Ongoing discussion in Python lists seem to indicate that there a rather rough edges still. > - Sam Ruby leo

Re: case mangling and binary strings

2004-11-03 Thread Leopold Toetsch
Dan Sugalski <[EMAIL PROTECTED]> wrote: > At 6:16 PM +0100 11/2/04, Leopold Toetsch wrote: >> >>If it's pure binary pitch a fit. >>If it has an encoding attached, continue, > Yeah, that's the plan. > I'd like to add another entry to the internal API: >OPTIONAL_INTERNAL_EXCEPTION > which wor

Re: case mangling and binary strings

2004-11-03 Thread Leopold Toetsch
Dan Sugalski <[EMAIL PROTECTED]> wrote: > I expect I'll put together a Unicode charset that uses ICU to do its > thing, and go from there. We certainly need Unicode support, so it's > not like we can't do it. (And we still don't have a better option, > unfortunately) ICU 3.0 should be out AFAIK.

Re: case mangling and binary strings

2004-11-03 Thread Adam Thomason
On Tue, 2 Nov 2004 13:33:47 -0500, Dan Sugalski <[EMAIL PROTECTED]> wrote: > At 1:16 PM -0500 11/2/04, Sam Ruby wrote: > > Making ICU optional, at least. It's too problematic on too many > platforms, and just turns into a big headache. It seemed like a good > idea at the time, and while it's still

Re: case mangling and binary strings

2004-11-02 Thread Jeff Clites
On Nov 2, 2004, at 10:46 AM, Dan Sugalski wrote: At 1:42 PM -0500 11/2/04, Sam Ruby wrote: I don't care if Parrot uses ICU on any platform. I do care that Parrot supports utf-8 on every platform. Ah, OK. Yes, we will support all the unicode encodings, as well as the unicode character set, on all p

Re: case mangling and binary strings

2004-11-02 Thread Ben Morrow
Quoth [EMAIL PROTECTED] (Dan Sugalski): > Okay, here's a question for everyone to hash out. > > Assuming I have a parrot string which is explicitly marked as a > binary string... > > What should happen when it's told to upcase/downcase/titlecase > itself? (You may assume that we have strings w

Re: case mangling and binary strings

2004-11-02 Thread Dan Sugalski
At 1:42 PM -0500 11/2/04, Sam Ruby wrote: Dan Sugalski wrote: At 1:16 PM -0500 11/2/04, Sam Ruby wrote: Dan Sugalski wrote: At 5:43 PM + 11/2/04, Nicholas Clark wrote: On Tue, Nov 02, 2004 at 12:35:26PM -0500, Sam Ruby wrote: However, str has an upper() method defined on it. The way it operat

Re: case mangling and binary strings

2004-11-02 Thread Sam Ruby
Dan Sugalski wrote: At 1:16 PM -0500 11/2/04, Sam Ruby wrote: Dan Sugalski wrote: At 5:43 PM + 11/2/04, Nicholas Clark wrote: On Tue, Nov 02, 2004 at 12:35:26PM -0500, Sam Ruby wrote: However, str has an upper() method defined on it. The way it operates is to take the range of bytes that co

Re: case mangling and binary strings

2004-11-02 Thread Dan Sugalski
At 1:16 PM -0500 11/2/04, Sam Ruby wrote: Dan Sugalski wrote: At 5:43 PM + 11/2/04, Nicholas Clark wrote: On Tue, Nov 02, 2004 at 12:35:26PM -0500, Sam Ruby wrote: However, str has an upper() method defined on it. The way it operates is to take the range of bytes that correspond to us-ascii

Re: case mangling and binary strings

2004-11-02 Thread Dan Sugalski
At 6:16 PM +0100 11/2/04, Leopold Toetsch wrote: Dan Sugalski <[EMAIL PROTECTED]> wrote: Okay, here's a question for everyone to hash out. Assuming I have a parrot string which is explicitly marked as a binary string... What should happen when it's told to upcase/downcase/titlecase itself? I

Re: case mangling and binary strings

2004-11-02 Thread Sam Ruby
Dan Sugalski wrote: At 5:43 PM + 11/2/04, Nicholas Clark wrote: On Tue, Nov 02, 2004 at 12:35:26PM -0500, Sam Ruby wrote: However, str has an upper() method defined on it. The way it operates is to take the range of bytes that correspond to us-ascii and perform a us-ascii uppercase on them.

Re: case mangling and binary strings

2004-11-02 Thread Leopold Toetsch
Dan Sugalski <[EMAIL PROTECTED]> wrote: > Okay, here's a question for everyone to hash out. > Assuming I have a parrot string which is explicitly marked as a > binary string... > What should happen when it's told to upcase/downcase/titlecase > itself? If it's pure binary pitch a fit. If it has a

Re: case mangling and binary strings

2004-11-02 Thread Dan Sugalski
At 5:43 PM + 11/2/04, Nicholas Clark wrote: On Tue, Nov 02, 2004 at 12:35:26PM -0500, Sam Ruby wrote: However, str has an upper() method defined on it. The way it operates is to take the range of bytes that correspond to us-ascii and perform a us-ascii uppercase on them. The remaining byte

Re: case mangling and binary strings

2004-11-02 Thread Nicholas Clark
On Tue, Nov 02, 2004 at 12:35:26PM -0500, Sam Ruby wrote: > However, str has an upper() method defined on it. The way it operates > is to take the range of bytes that correspond to us-ascii and perform a > us-ascii uppercase on them. The remaining bytes are left alone. I'd prefer parrot not t

Re: case mangling and binary strings

2004-11-02 Thread James Mastros
Dan Sugalski wrote: Okay, here's a question for everyone to hash out. Assuming I have a parrot string which is explicitly marked as a binary string... What should happen when it's told to upcase/downcase/titlecase itself? (You may assume that we have strings which are explicitly marked at least

Re: case mangling and binary strings

2004-11-02 Thread Sam Ruby
Dan Sugalski wrote: Okay, here's a question for everyone to hash out. Assuming I have a parrot string which is explicitly marked as a binary string... What should happen when it's told to upcase/downcase/titlecase itself? (You may assume that we have strings which are explicitly marked at least

Re: case mangling and binary strings

2004-11-02 Thread Dan Sugalski
At 12:11 PM -0500 11/2/04, Matt Fowles wrote: Dan~ I vote for doing nothing in the up/down case options as those are frequently just used to get a cannonical form for comparison. Although I could understand an argument for throwing an exception... People better not be using binary data as a canonic

Re: case mangling and binary strings

2004-11-02 Thread Matt Fowles
Dan~ I vote for doing nothing in the up/down case options as those are frequently just used to get a cannonical form for comparison. Although I could understand an argument for throwing an exception... Matt On Tue, 2 Nov 2004 11:53:08 -0500, Dan Sugalski <[EMAIL PROTECTED]> wrote: > Okay, here

Re: case mangling and binary strings

2004-11-02 Thread Nicholas Clark
On Tue, Nov 02, 2004 at 11:53:08AM -0500, Dan Sugalski wrote: > Okay, here's a question for everyone to hash out. > > Assuming I have a parrot string which is explicitly marked as a > binary string... > > What should happen when it's told to upcase/downcase/titlecase > itself? (You may assume t