Re: More character matching bits

2001-06-12 Thread Jarkko Hietaniemi
For reference, here's how Perl 5.8 will define \p{IsFoo} character classes: # 005F: SPACING UNDERSCROE ['IsWord', '$cat =~ /^[LMN]/ or $code eq "005F"', ''], ['IsAlnum', '$cat =~ /^[LMN]/',''], ['IsAlpha', '$cat =~ /^[LM]/', ''], # 0009: HORIZONTAL TABULATION #

Re: More character matching bits

2001-06-12 Thread Bryan C . Warnock
On Wednesday 13 June 2001 12:23 am, Jarkko Hietaniemi wrote: > > RE Feature Override Create New > > > > switches 'i' only yes > > anchorsno no > > (I would call them assertions.) Bzzt. > Another gig for Bean. > >

Re: More character matching bits

2001-06-12 Thread Jarkko Hietaniemi
> RE Feature Override Create New > > switches 'i' only yes > anchorsno no (I would call them assertions.) Bzzt. > - Anchors. ^,$,\A,\Z,\z,\b, \G. Since the definition of a line (see 'm' > and 's' above) isn't

Re: More character matching bits

2001-06-12 Thread Bryan C . Warnock
On Tuesday 12 June 2001 10:58 pm, Bryan C. Warnock wrote: > On Tuesday 12 June 2001 09:16 pm, Simon Cozens wrote: > > On Tue, Jun 12, 2001 at 05:41:40PM -0700, Hong Zhang wrote: > > > We should let external collator to handle all these fancy features. > > > > Phew, I've been saying this all along.

Re: More character matching bits

2001-06-12 Thread Jarkko Hietaniemi
> I think, following my line of thought, that [a-\N{KATAKANA LETTER KI}] > should be equivalent to [\x{0061}-\x{30AD}], which would match any of I think it should be an error. If you mean the code points write the code points. Mixing symbolic names (KATAKANA LETTER KI) and native characters (th

Re: More character matching bits

2001-06-12 Thread Buddha Buck
Jarkko Hietaniemi <[EMAIL PROTECTED]> writes: > > Perl came from ASCII-centric roots, so it's likely that most of our > > biases are ASCII-centric. And for a couple of reasons, it's going to > > be hard to deal with that: > > > > 1. Backwards compatability with existing Perl practice, > > > >

Re: More character matching bits

2001-06-12 Thread Bryan C . Warnock
On Tuesday 12 June 2001 11:06 pm, Jarkko Hietaniemi wrote: > > I. Make ranges work on Unicode code-points (if they don't already). > > U, yes, they do, if you by code-point ranges mean \x{...}-\x{...} > but in general I would like to discourage the use of ranges. What do > you think [a-\N{KAT

Re: More character matching bits

2001-06-12 Thread Jarkko Hietaniemi
> Perl came from ASCII-centric roots, so it's likely that most of our > biases are ASCII-centric. And for a couple of reasons, it's going to > be hard to deal with that: > > 1. Backwards compatability with existing Perl practice, > > and > > 2. To do language-neutral right is -really- hard; lo

Re: More character matching bits

2001-06-12 Thread Bryan C . Warnock
On Tuesday 12 June 2001 09:16 pm, Simon Cozens wrote: > On Tue, Jun 12, 2001 at 05:41:40PM -0700, Hong Zhang wrote: > > We should let external collator to handle all these fancy features. > > Phew, I've been saying this all along. :) I think we've *all* been saying that. We just need to determin

Re: More character matching bits

2001-06-12 Thread Buddha Buck
Dan Sugalski <[EMAIL PROTECTED]> writes: > > We probably also ought to answer the question "How accommodating to > non-latin writing systems are we going to be?" It's an uncomfortable > question, but one that needs asking. Answering by Larry, probably, but > definitely asking. Perl's not real

Re: More character matching bits

2001-06-12 Thread Simon Cozens
We've pretty much run this subthread out of Perl content by now, so it ought to stop here, and I should start exercising some of that "restraint" thing. (Does it grow if you exercise it?) So Damien, we can take it to private mail or to sci.lang.japan or something, but if you promise to stop diggi

Re: More character matching bits

2001-06-12 Thread Simon Cozens
On Tue, Jun 12, 2001 at 06:45:31PM -0700, Damien Neil wrote: > > Hrm, no, not usually; furigana are almost always hiragana, and > > learner's textbooks - bah, they're not real Japanese. :) > > I believe you are confused; *cough*. I believe I am not. But who am I? Let's ask Kenkyusha - admittedly

Re: More character matching bits

2001-06-12 Thread Damien Neil
On Wed, Jun 13, 2001 at 02:15:16AM +0100, Simon Cozens wrote: > Or we could keep it out of core. It's up to you, really. No, it isn't. It's up to Larry, or to whoever gets the regex pumpkin. I'm withdrawing from this discussion: My intent was to clarify exactly why someone might want to treat K

RE: More character matching bits

2001-06-12 Thread Grant Mongardi
On Tue, Jun 12, 2001 at 06:44:02PM -0400, Dan Sugalski wrote: > We probably also ought to answer the question "How accommodating to > non-latin writing systems are we going to be?" What if Perl 6 simply reserved tags for extensions? This could assume processing similar to Perl 5 for compatibility

Re: More character matching bits

2001-06-12 Thread Simon Cozens
On Tue, Jun 12, 2001 at 05:41:40PM -0700, Hong Zhang wrote: > We should let external collator to handle all these fancy features. Phew, I've been saying this all along. :) > Please note regex is O(n) at best, adding an external collator > will make is O(2n). While this is very true, I think con

Re: More character matching bits

2001-06-12 Thread Simon Cozens
On Tue, Jun 12, 2001 at 05:40:32PM -0700, Damien Neil wrote: > The ability to match Hiragana as Katakana and vice-versa is almost > identical conceptually to the ability to perform case insensitive > matches on English text. I am going to choose not to disagree with you on this, but... > > What

Re: More character matching bits

2001-06-12 Thread Jarkko Hietaniemi
On Tue, Jun 12, 2001 at 05:41:40PM -0700, Hong Zhang wrote: > > We should let external collator to handle all these fancy features. > People can always normalize/canonicalize/do-whatever-you-want > and send the result text/binary to regex. All the features we > argue about here can be easily done

Re: More character matching bits

2001-06-12 Thread Damien Neil
On Wed, Jun 13, 2001 at 01:22:32AM +0100, Simon Cozens wrote: > I'd say it was about as useful as providing a regexp option to translate > the search term into French and try that instead.[1] Handy, possibly. > Essential? No. Something that should be part of the core? I'll leave > that for you to

RE: More character matching bits

2001-06-12 Thread Hong Zhang
We should let external collator to handle all these fancy features. People can always normalize/canonicalize/do-whatever-you-want and send the result text/binary to regex. All the features we argue about here can be easily done by a customized collator. Do NOT expect the Perl regex be a linguist

Re: More character matching bits

2001-06-12 Thread Simon Cozens
On Tue, Jun 12, 2001 at 05:03:17PM -0700, Damien Neil wrote: > I can say that I feel that providing a mechanism for Hiragana > characters to match Katakana and vice-versa is about as useful for a > person doing Japanese text processing as case-insensitive matching is > for a person working with En

Re: More character matching bits

2001-06-12 Thread Damien Neil
On Tue, Jun 12, 2001 at 06:44:02PM -0400, Dan Sugalski wrote: > While that's true, KATAKANA LETTER A and HIRAGANA LETTER A are also > referring to distinct things. (Though arguably not as distinct as either > with LATIN CAPITAL A) If we do one, why not the other? I'm perfectly happy > with an a

Re: More character matching bits

2001-06-12 Thread Dan Sugalski
At 03:12 PM 6/11/2001 -0700, Damien Neil wrote: >On Mon, Jun 11, 2001 at 05:03:26PM -0400, Dan Sugalski wrote: > > I don't think just /i should do that, as it seems rather extreme. (If you > > took that argument, it would seem to follow that KATAKANA LETTER A matches > > LATIN CAPITAL A, and I don

Re: Should the op dispatch loop decode?

2001-06-12 Thread Damien Neil
On Tue, Jun 12, 2001 at 06:12:35PM -0400, Dan Sugalski wrote: > At the moment I'm leaning towards the functions doing their own decoding, > as it seems likely to be faster. (Though we'd be duplicating the decoding > logic everywhere, and bigger's reasonably bad) Possibly mandating shadow > func

Re: Should the op dispatch loop decode?

2001-06-12 Thread Benjamin Stuhl
--- Dan Sugalski <[EMAIL PROTECTED]> wrote: > 'Kay, here's a question to ponder. Should the op dispatch > loop handle > argument decoding, or should that be left to the opcode > functions? [good analysis of trade-off's snipped] > At the moment I'm leaning towards the functions doing > their own

Re: tasty db data

2001-06-12 Thread Dan Sugalski
At 04:39 PM 6/12/2001 -0500, David L. Nicol wrote: >Dan Sugalski wrote: > > > > David L. Nicol [made an akward metaphor with data as summer campers] > > This is a considerably less simple problem than you (and *definitely* I) > > might like. :( > >I appear to be suggesting that deferability be an

Should the op dispatch loop decode?

2001-06-12 Thread Dan Sugalski
'Kay, here's a question to ponder. Should the op dispatch loop handle argument decoding, or should that be left to the opcode functions? The upside to the functions handling the decoding is they can special-case it. makeref (a hypothetical "make a reference to a PMC" operator), for example, wo

tasty db data

2001-06-12 Thread David L. Nicol
Since I just proposed a new paradigm I'll try to apply it, before darting down the hill and getting my sandwich. Dan Sugalski wrote: > > David L. Nicol [made an akward metaphor with data as summer campers] > That's less easy than you might think. Quick: > >$bar = bar(); > > is $bar acti

Re: deferred FETCH called "tasty"

2001-06-12 Thread Dan Sugalski
At 03:55 PM 6/12/2001 -0500, David L. Nicol wrote: >We can't simply tie in the perl5 sense, because tiedness does not persist >accross assignment. So we need to either modify Assignment to allow more >than value to get assigned, or we need to pass around something -- a magic >coderef, perhaps --

Re: Multi-dimensional arrays and relational db data

2001-06-12 Thread Dan Sugalski
At 02:07 PM 6/12/2001 -0500, David L. Nicol wrote: >Dan Sugalski wrote: > > > I'm still trying to formulate a good set of rules on how I think active > > data should perform under optimization to pass on to Larry. > > > >How about, Active data doesn't get optimized. Static data doesn't >care if y

deferred FETCH called "tasty"

2001-06-12 Thread David L. Nicol
I think I'm repeating what has been said already but here goes. After sending this I'm breaking for a sandwich. :) for database data, the problem domain is limited sensibly. We want to defer as many lookups as possible, so they can be sent as a bunch rather than sent one by one, and we would l

Re: Coupla Questions

2001-06-12 Thread David L. Nicol
Damian Conway wrote: > > Graham wrote: > >> Now I may be wrong here, but I thought I remembered something about >> >> .foo being the same as $_.foo > > It's certainly a possibility. > >> In which case you could do >> >>for (%database.$accountnumber) { >> >

Re: Multi-dimensional arrays and relational db data

2001-06-12 Thread David L. Nicol
Dan Sugalski wrote: > I'm still trying to formulate a good set of rules on how I think active > data should perform under optimization to pass on to Larry. > > Dan How about, Active data doesn't get optimized. Static data doesn't care if you access ir or

Re: Social Reform

2001-06-12 Thread Daniel S. Wilkerson
Excuse me, my mistake. David Grove wrote: > > If you have not been following this thread, then maybe that is > > the reason for > > the confused-sounding nature of your email. > > > > I would say Simon was the one "ignoring an issue and attacking a > > person", not > > Vijay. I think Vijay was

Re: Embrace polymorphic builtins, for they are cool.

2001-06-12 Thread John Porter
Daniel S. Wilkerson wrote: > Consider the beauty of a fast and complex Irish ceili dance and how > one clumsy beginner can easily break someone else's ankle, and you see > that the very strict rules of that dance are necessary for the freedom > to dance it without getting hurt. No rules lead to "

RE: Social Reform

2001-06-12 Thread David Grove
> > Well, I *have* been following the discussion. And to me, it looks indeed > > like you, Simon, were indeed attacking ME on non-technical grounds. > > Vijay just jumped in for him, like a lioness trying to protect her > > kittens. > > Which he does from time to time, as do most of us, myself lik

Re: Coupla Questions

2001-06-12 Thread Damian Conway
Graham wrote: > Now I may be wrong here, but I thought I remembered something about > > .foo being the same as $_.foo It's certainly a possibility. > In which case you could do > >for (%database.$accountnumber) { > >.interestearned += $interestrat

RE: Social Reform

2001-06-12 Thread David Grove
> -Original Message- > From: Bart Lateur [mailto:[EMAIL PROTECTED]] > Sent: Tuesday, June 12, 2001 10:48 AM > To: Perl 6 Language Mailing List > Subject: Re: Social Reform > > > On Tue, 12 Jun 2001 08:54:13 +0100, Simon Cozens wrote: > > >On Mon, Jun 11, 2001 at 05:19:26PM -0700, Daniel S.

Re: Social Reform

2001-06-12 Thread Bart Lateur
On Tue, 12 Jun 2001 08:54:13 +0100, Simon Cozens wrote: >On Mon, Jun 11, 2001 at 05:19:26PM -0700, Daniel S. Wilkerson wrote: >> I would say Simon was the one "ignoring an issue and attacking a person", not >> Vijay. > >You are wrong. Go back through the archives. Vijay has posted four >messages

RE: Social Reform

2001-06-12 Thread David Grove
> On Mon, Jun 11, 2001 at 05:19:26PM -0700, Daniel S. Wilkerson wrote: > > I would say Simon was the one "ignoring an issue and attacking > a person", not > > Vijay. > > You are wrong. Go back through the archives. Vijay has posted four > messages: two of which are critical of Perl, two of which a

Re: Multi-dimensional arrays and relational db data

2001-06-12 Thread Dan Sugalski
At 10:33 PM 6/11/2001 -0500, David L. Nicol wrote: >[EMAIL PROTECTED] wrote: > > > You may wish to read this thread about lazy arrays and object > > persistence to get an idea of what you're getting into. > > http://www.geocrawler.com/archives/3/3024/2001/3/0/5427925/ > >Taking lazy as far as we c

EXPECTED LIST BEHAVIOUR

2001-06-12 Thread Dan Sugalski
Folks, I'd like to take this opportunity to point out that it is expected that we will all conduct ourselves in a generally professional and courteous manner on these lists. If for some reason you find you can't do this, do please take a step back until you can--if you don't a step will be pro

RE: suggested properties of operator results

2001-06-12 Thread Dan Sugalski
At 09:15 AM 6/12/2001 +1000, Damian Conway wrote: >Dave Whipp asks: > > > Does it do short-circuit evaluation, too? > >I would certainly expect it to, yes. It will, unless Larry specs it out otherwise. Dan --"it's l

RE: Social Reform

2001-06-12 Thread David Grove
> If you have not been following this thread, then maybe that is > the reason for > the confused-sounding nature of your email. > > I would say Simon was the one "ignoring an issue and attacking a > person", not > Vijay. I think Vijay was the one pointing out that this person ("Me") was > contrib

Re: Social Reform

2001-06-12 Thread Simon Cozens
On Mon, Jun 11, 2001 at 05:19:26PM -0700, Daniel S. Wilkerson wrote: > I would say Simon was the one "ignoring an issue and attacking a person", not > Vijay. You are wrong. Go back through the archives. Vijay has posted four messages: two of which are critical of Perl, two of which are pretty he

Re: Coupla Questions

2001-06-12 Thread Graham Barr
On Mon, Jun 11, 2001 at 10:39:51PM -0500, David L. Nicol wrote: > Hopefully, we'll get a "with" operator and everything: > > with %database.$accountnumber { > > .interestearned += $interestrate * .balance > > } > > anything short of that, in my opinion, is merely trad