parrotcode.org version info patch
attached below sig. --cal "just take a little look from our side when you can" --- index.html Wed Mar 19 08:10:15 2003 +++ index.html.new Wed Mar 19 08:11:03 2003 @@ -62,9 +62,9 @@ How to get the code Periodic releases will appear on CPAN; the current release is -version 0.0.9, and can be found in the CPAN +version 0.0.10, and can be found in the CPAN -http://www.cpan.org/modules/by-authors/id/S/SF/SFINK/parrot-0.0.9.tar.gz";>source directory. +http://www.cpan.org/modules/by-authors/id/S/SF/SFINK/parrot-0.0.10.tar.gz";>source directory.
[RFC] maninfestation, was Re: Parrot binaries
On Mar-18, Ask Bjoern Hansen wrote: > > Hi, > > I tried making an RPM .spec file for parrot and noticed that there's > still no "make install" target. > > To be useful or at least fun to play with I think we would need to > also install things from languages/, but they seem to generally be, > uh, fairly unstructured. :-) I've attached a proof-of-concept patch that I'd like people to comment on in terms of general approach. I made a new MANIFEST.detailed that is the master record of all files as well as what should be done with them. (I haven't done this part yet, but I was thinking the plain MANIFEST could be autogenerated from this file but still committed to CVS.) MANIFEST.detailed in addition to listing files, also gives metadata for each file -- in particular, which sub-RPM (if any) a file should go into. It also includes any generated files that will need to make it into some RPM. I also provided a parrot.spec file that, in combination with tools/mk_manifests.pl, will create a parrot RPM, a parrot-devel RPM, and an RPM for each of the languages. (For now, at least, those should probably all be wrapped up into a single parrot-languages RPM.) My current implementation is quite messy, but it should give an idea for what I'm talking about. rpm-building.tar.gz Description: GNU Zip compressed data
Re: [RFC] maninfestation, was Re: Parrot binaries
On Mar-19, Steve Fink wrote: > > I've attached a proof-of-concept patch that I'd like people to comment Oops! Sorry, that was an ancient parrot.spec file. Here's a new tar file that should actually work. rpm-building.tar.gz Description: GNU Zip compressed data
Re: Rules and hypotheticals: continuations versus callbacks
I would like to express my sincere gratitude to all of you for working through these issues. I bent my brain on the Perl 5 regex engine, and that was just a "simple" recurse-on-success engine--and I'm not the only person it drove mad. I deeply appreciate that Perl 6's regex engine may drive you even madder. But such sacrifices are at the heart of why people love Perl. Thanks! Larry
Re: Rules and hypotheticals: continuations versus callbacks
On Wed, 19 Mar 2003, Matthijs van Duin wrote: > >(4) (internals) Given that Parrot has so many different control mechanisms > >(call/ret, exceptions, closures, continuations, ...), how do we maintain > >consistency? And how much of that is parrot's responsibility (versus the > >perl6 compiler's)? > > I don't think there will be a big problem with that, but maybe I'm just > overlooking things. Exceptions, for example, will have to unwind dynamic bindings, stack frames, call user-defined exception handlers, etc. Using continuations for everything, this isn't a problem -- get it right once and it works everywhere (btw, languages/perl6 uses continuations for its exceptions now). However, they're much more expensive than normal sub calls, so we probably don't want to use them there. I'll let Dan talk about potential issues with throwing exceptions across languages, since he's given it much more thought than yours truly, /s
Re: Rules and hypotheticals: continuations versus callbacks
On Tue, 18 Mar 2003, Matthijs van Duin wrote: > and maybe also: > What is the current plan? > > although I got the impression earlier that there isn't any yet for invoking > subrules :-) See line 1014, languages/perl6/P6C/rule.pm. The hack I used was to call rules like ordinary subs, and have them push marks onto the regex stack before they return. I'm not sure if this can be made to work with hypotheticals, and I'm sure it won't interact kindly with continuation-taking, but there's _something_. As for the interaction with continuations, I was about to post some of my concerns when I received your long and well-thought-out mail. I need to think about the discussion so far a bit more, but briefly: (1) There's more than one way to go when combining dynamically-scoped variables with continuations: for example, do you use dynamic bindings from where the continuation was taken, or from where it's invoked? (see e.g. Scheme's "dynamic-wind"). (2) (internals) The functional-language people have found that full continuations are slow, and put a lot of effort into avoiding them where possible. Backtracking languages like Icon and Prolog are implemented by special mechanisms rather than general continuations, probably for this reason. So if we're forced to do a regex engine using full continuations, it will probably be dog-slow (3) On the other hand, we probably want people to intermix regex backtracking, continuation-taking, and hypothetical/dynamic variables, and have it "do the right thing", where "right" means something like "mind-bendingly difficult to reason about, but consistent". How do we want these features to play with each other? (4) (internals) Given that Parrot has so many different control mechanisms (call/ret, exceptions, closures, continuations, ...), how do we maintain consistency? And how much of that is parrot's responsibility (versus the perl6 compiler's)? /s
Re: A6 Request: Change .req to .arity
--- Steffen Mueller <[EMAIL PROTECTED]> wrote: > Damian Conway wrote: > > Larry wrote: > > > >> On the other hand, I could see an argument that said anyone who > >> doesn't know what .arity means shouldn't be writing routines that > >> depend on it... > > > That was more or less my line of thought. > > Now, I think I'll dare claim my English is not exactly bad for a 21 > year-old non-native speaker. Being a physics and CS student, I do > also have mathematical background, but it still took me a few seconds > to figure out "arity" *in this context*. Maybe that's because I can't > think of an exact German equivalent either; maybe it's because I don't > think a function's arity is quite the same as it's *minimum* number of > parameters? I mean, it makes sense in a functional language... but > you don't have functions with a variable number of arguments there. (Your English sounds pretty darned good to me. :) > To cut this short: I think req or reqargs or somesuch would be > better. Why choose the method names that sound more like computer > science for the very sake of that? FWIW, I'm a CS professional with an appalling lack of mathematical background (though I scored above the math average for Engineers on my GRE, and didn't miss any of the logic questions). I'd never *heard* of "arity", but tend to use esoteric language attributes rather often (which is not a boast -- it's a rather bad habit, but there it is). Anyway, I like writing functions and methods that DWIM, but sometimes WIM takes some doing. I'd rather have a name that means something to me, too though to be honest, "arity" would mean something to me, if someone would just explain it, lol Paul __ Do you Yahoo!? Yahoo! Platinum - Watch CBS' NCAA March Madness, live on your desktop! http://platinum.yahoo.com
Re: Rules and hypotheticals: continuations versus callbacks
At 5:54 PM + 3/19/03, Simon Cozens wrote: [EMAIL PROTECTED] (Dan Sugalski) writes: At 5:47 PM + 3/19/03, Simon Cozens wrote: >[EMAIL PROTECTED] (Dan Sugalski) writes: >> you aren't allowed to selectively redefine >> rules in the middle of a regex that uses those rules. > >This is precisely what a macro does. Not once execution starts, no. Compilation's just execution of a regex, albeit the Perl6::Grammar::program regex, and that regex will need to be modified while it's in operation in order to pick up macro "is parsed" definitions and apply them to the rest of what it's parsing. Ah, damn, I wasn't thinking far enough out. I'm not sure it'll work quite like that, with a single call to the regex engine that spits out everything in one go. More likely it'll be a set of iterative calls to the engine that terminate at natural sequence points, potentially with recursive calls into the parsing regex. -- Dan --"it's like this"--- Dan Sugalski even samurai [EMAIL PROTECTED] have teddy bears and even teddy bears get drunk
Re: Rules and hypotheticals: continuations versus callbacks
[EMAIL PROTECTED] (Dan Sugalski) writes: > >Compilation's just execution of a regex, albeit the Perl6::Grammar::program > >regex, and that regex will need to be modified while it's in operation in > >order to pick up macro "is parsed" definitions and apply them to the rest > >of what it's parsing. > > Ah, damn, I wasn't thinking far enough out. This, you see, is precisely why some of us started work last year on a regular expression engine which could handle having its expressions rewritten during the match... ;) -- Last week I forgot how to ride a bicycle. -- Steven Wright
Re: A6 Request: Change .req to .arity
--- Paul <[EMAIL PROTECTED]> wrote: > > --- Steffen Mueller <[EMAIL PROTECTED]> wrote: > > Damian Conway wrote: > > > Larry wrote: > > > > > >> On the other hand, I could see an argument that said anyone who > > >> doesn't know what .arity means shouldn't be writing routines > that > > >> depend on it... > > > > > That was more or less my line of thought. > > > > Now, I think I'll dare claim my English is not exactly bad for a 21 > > > year-old non-native speaker. Being a physics and CS student, I do > > also have mathematical background, but it still took me a few > seconds > > to figure out "arity" *in this context*. Maybe that's because I > can't > > think of an exact German equivalent either; maybe it's because I > don't > > think a function's arity is quite the same as it's *minimum* number > of > > parameters? I mean, it makes sense in a functional language... but > > you don't have functions with a variable number of arguments there. > > (Your English sounds pretty darned good to me. :) > > > To cut this short: I think req or reqargs or somesuch would be > > better. Why choose the method names that sound more like computer > > science for the very sake of that? > > FWIW, I'm a CS professional with an appalling lack of mathematical > background (though I scored above the math average for Engineers on > my > GRE, and didn't miss any of the logic questions). I'd never *heard* > of > "arity", but tend to use esoteric language attributes rather often > (which is not a boast -- it's a rather bad habit, but there it is). > > Anyway, I like writing functions and methods that DWIM, but sometimes > WIM takes some doing. I'd rather have a name that means something to > me, too though to be honest, "arity" would mean something to me, > if > someone would just explain it, lol > http://www.dictionary.com/search?q=arity 1 entry found for arity. arity The number of arguments a function or operator takes. In some languages functions may have variable arity which sometimes means their last or only argument is actually a list of arguments. (1997-07-21) Source: The Free On-line Dictionary of Computing, © 1993-2001 Denis Howe =Austin
Re: A6 Request: Change .req to .arity
On Wed, Mar 19, 2003 at 04:38:51PM +0100, Steffen Mueller wrote: : Damian Conway wrote: : >Larry wrote: : > : >>On the other hand, I could see an argument that said anyone who : >>doesn't know what .arity means shouldn't be writing routines that : >>depend on it... : : >That was more or less my line of thought. : : Now, I think I'll dare claim my English is not exactly bad for a 21 : year-old non-native speaker. Being a physics and CS student, I do also : have mathematical background, but it still took me a few seconds to : figure out "arity" *in this context*. Maybe that's because I can't think : of an exact German equivalent either; maybe it's because I don't think a : function's arity is quite the same as it's *minimum* number of : parameters? I mean, it makes sense in a functional language... but you : don't have functions with a variable number of arguments there. Sure, but one can imagine having functions with a given arity that can nonetheless be modified adverbially. In this view, required parameters contribute to "arity", but optional parameters are only used for, er, options. Larry
Re: Rules and hypotheticals: continuations versus callbacks
On Wed, Mar 19, 2003 at 12:35:19PM -0500, Dan Sugalski wrote: Then I wasn't clear enough, sorry. This is perl -- the state of something at compile time is just a suggestion as to how things ultimately work. Yes, hence my surprise about actually inlining stuff, luckily that was just a misunderstanding :-) I'll nudge Larry to add it explicitly, but in general redefinitons of code that you're in the middle of executing don't take effect immediately, and it's not really any different for regex rules than for subs. Ah, but we're not redefining the sub that's running, but the subs it's about to call. That works for subs, and Simon Cozens already pointed out we certainly also need it for rules :-) Actually, we should be extraordinarily liberal with the application of restrictions at this phase. It's far easier to lift a restriction later than to impose it later, This is perl 6, we can add a new restriction next week and I very much want to stomp out any constructs that will force slow code execution. Yes, I may lose, but if I don't try... You're absolutely right, and optimization is very important to me too. But you can't *only* look at the speed of constructs, or we'll be coding in C or assembly :-) We'll need to meet in the middle.. The issue of hypotheticals is complex. Well, I'm a big boy, I'm sure I can handle it. Are you even talking about semantics or implementation here? Because I already gave my insights on semantics, and I have 'em in my head for implementation too but I should probably take those to perl6-internals instead. Ultimately the question is "How do you backtrack into arbitrary code, and how do we know that the arbitrary code can be backtracked into?" My answer is we don't, but I'm not sure how popular that particular answer is. I say, make generic semantics first, and then optimize the heck out of it. That's fine. I disagree. :) Now that Simon Cozens has established that sub-rules need to be looked up at runtime, I think we can both be happy: As far as I can see, a rule will consist of two parts: The wrapper that will handle stuff when the rule is invoked as a normal method, perhaps handle modifiers, handle searches for unanchored matches, setup the state, etc; and the actual body that does a match at the current position. Now, what you want is that subrule-invocation goes directly from body to body, skipping the overhead of method invocation to the wrapper. I say, when you look up the method for a subrule, check if it is a regular rule and if so call its body directly, and otherwise use the generic mechanism. I'll get my lovely generic semantics with the direct body->body calling hidden away as an optimization details, and I get the ability to write rule-methods in perl code. You still get your low-overhead body->body calls and therefore the speed you desire (hopefully). Since you need to fetch the rule body anyway, there should be no extra overhead: where you'd normally throw an error (non-rule invoked as subrule) you'd switch to generic invocation instead. Sounds like a good deal? :-) -- Matthijs van Duin -- May the Forth be with you!
Re: Rules and hypotheticals: continuations versus callbacks
At 8:04 PM +0100 3/19/03, Matthijs van Duin wrote: On Wed, Mar 19, 2003 at 12:35:19PM -0500, Dan Sugalski wrote: I'll nudge Larry to add it explicitly, but in general redefinitons of code that you're in the middle of executing don't take effect immediately, and it's not really any different for regex rules than for subs. Ah, but we're not redefining the sub that's running, but the subs it's about to call. That works for subs, and Simon Cozens already pointed out we certainly also need it for rules :-) Well, I'm not 100% sure we need it for rules. Simon's point is well-taken, but on further reflection what we're doing is subclassing the existing grammar and reinvoking the regex engine on that subclassed grammar, rather than redefining the grammar actually in use. The former doesn't require runtime redefinitions, the latter does, and I think we're going to use the former scheme. Actually, we should be extraordinarily liberal with the application of restrictions at this phase. It's far easier to lift a restriction later than to impose it later, This is perl 6, we can add a new restriction next week We can't add them once we hit betas. I'd as soon add them now, rather than later. and I very much want to stomp out any constructs that will force slow code execution. Yes, I may lose, but if I don't try... You're absolutely right, and optimization is very important to me too. But you can't *only* look at the speed of constructs, or we'll be coding in C or assembly :-) We'll need to meet in the middle.. Well, not to be too cranky (I'm somewhat ill at the moment, so I'll apologize in advance) but... no. No, we don't actually have to, though if we could that'd be nice. The issue of hypotheticals is complex. Well, I'm a big boy, I'm sure I can handle it. Are you even talking about semantics or implementation here? Because I already gave my insights on semantics, and I have 'em in my head for implementation too but I should probably take those to perl6-internals instead. Semantics. Until Larry's nailed down what he wants, there are issues of reestablishing hypotheticals on continuation reinvocation, flushing those hypotheticals multiple times, what happens to hypotheticals when you invoke a continuation with hypotheticals in effect, what happens to hypotheticals inside of coroutines when you establish them then yield out, and when hypotheticals are visible to other threads. I read through your proposal (I'm assuming it's the one that started this thread) and it's not sufficient unless I missed something, which I may have. Ultimately the question is "How do you backtrack into arbitrary code, and how do we know that the arbitrary code can be backtracked into?" My answer is we don't, but I'm not sure how popular that particular answer is. I say, make generic semantics first, and then optimize the heck out of it. That's fine. I disagree. :) Now that Simon Cozens has established that sub-rules need to be looked up at runtime, Well Sounds like a good deal? :-) At the moment, no. It seems like a potentially large amount of overhead for no particular purpose, really. I don't see any win in the regex case, and you're not generalizing it out to the point where there's a win there. (I can see where it would be useful in the general case, but we've come nowhere near touching that) -- Dan --"it's like this"--- Dan Sugalski even samurai [EMAIL PROTECTED] have teddy bears and even teddy bears get drunk
Re: A6: Named vs. Variadic Parameters
On Wednesday, March 19, 2003, at 09:58 AM, Larry Wall wrote: : sub foo($x, [EMAIL PROTECTED], +$k) {...}# (2) OK Fine, you can set @a using positional notation, like push(), in addition to the notations available to (1). But if you set "k =>", it has to be before the list, unless you pass the list explicitly as a "*@" named parameter. With the exception of [EMAIL PROTECTED] at the front as in (2), non-positional parameters don't pay any attention to their order of declaration. It's the "with the exception of [EMAIL PROTECTED] at the front" part that worries me quite a bit. I'd be a lot happier with having the rule be "non-positional parameters must come after positional parameters, but before any variadic elements". I think newbies are going to unquestionably try and put the parameters in the same order as they expect to see the eventual arguments, and be durn confused it doesn't work -- I know I would. Especially because: sub foo($x, +$k, *%h) {...} # (3) sub foo($x, *%h, +$k) {...} # (4) _are_ synonymous, but sub foo($x, +$k, [EMAIL PROTECTED]) {...} # (1) sub foo($x, [EMAIL PROTECTED], +$k) {...} # (2) are quite definitely not. Dunno. I'm just one datapoint, but I strongly see the difference between (1) and (2) as being a *huge* newbie trap. And it doesn't seem like you ever gain anything by specifying (1) -- I don't know why you would ever purposefully _want_ to do that, instead of (2). So I would still strongly urge that (1) and (2) be synonyms. MikeL
Re: Rules and hypotheticals: continuations versus callbacks
On Wed, Mar 19, 2003 at 02:31:58PM -0500, Dan Sugalski wrote: Well, I'm not 100% sure we need it for rules. Simon's point is well-taken, but on further reflection what we're doing is subclassing the existing grammar and reinvoking the regex engine on that subclassed grammar, rather than redefining the grammar actually in use. The former doesn't require runtime redefinitions, the latter does, and I think we're going to use the former scheme. That's not the impression I got from Simon It would also be rather annoying.. think about balanced braces etc, take this rather contrieved, but valid example: $x ~~ m X { macro ... yada yada yada; } X; It seems to be that you're really inside a grammar rule when that macro is defined. Otherwise you'd have to keep a lot of state outside the parser to keep track of such things, which is exactly what perl grammars were supposed to avoid I think. We can't add them once we hit betas. I'd as soon add them now, rather than later. Well, I'd rather not add it at all :-) We'll need to meet in the middle.. Well, not to be too cranky (I'm somewhat ill at the moment, so I'll apologize in advance) but... no. No, we don't actually have to, though if we could that'd be nice. OK, strictly speaking that's true, but I think we can Semantics. Until Larry's nailed down what he wants, there are issues of reestablishing hypotheticals on continuation reinvocation, They should be though, if a variable was hypothesized when the continuation was taken, then it should be hypothesized when that continuation is invoked. flushing those hypotheticals multiple times, Not idea what you mean what happens to hypotheticals when you invoke a continuation with hypotheticals in effect, Basically de-hypothesize all current hypotheticals, and re-hypothesize the ones that were hypothesized when the continuation was taken. You can ofcourse optimize this by skipping the "common ancestry", if you know what I mean what happens to hypotheticals inside of coroutines when you establish them then yield out, This follows directly from the implementation of coroutines: the first yield is a normal return, so if you hypothesize $x before that it'll stay hypothesized. if you then hypothesize $y outside the coroutine and call the coroutine again, $y will be de-hypothesized. If the coroutine then hypothesizes $z and yields out, $z will be de-hypothesized and $y re-hypothesized. $x will be unaffected by all this and when hypotheticals are visible to other threads. I haven't thought of that, but to be honest I'm not a big fan of preemptive threading anyway. Cooperative threading using continuations is probably faster, has no synchronization issues. And the behavior of hypotheticals follows naturally there (you can use 'let' or 'temp' to create thread- local variables in that case) I read through your proposal (I'm assuming it's the one that started this thread) and it's not sufficient unless I missed something, which I may have. Also look at Sean O'Rourke's reply and my reply to that; it contains additional info. Sounds like a good deal? :-) At the moment, no. It seems like a potentially large amount of overhead for no particular purpose, really. I have to admit I don't know the details of how your system works, but what I had in mind didn't have any extra overhead at all -- under the (apparently still debatable) assumption that you need to look up subrules at runtime anyway. You do agree that if that is possible, is *is* a good deal? I don't see any win in the regex case, and you're not generalizing it out to the point where there's a win there. (I can see where it would be useful in the general case, but we've come nowhere near touching that) We have come near it.. backtracking is easy using continuations, and we can certainly have rules set the standard for the general case. -- Matthijs van Duin -- May the Forth be with you!
Re: Rules and hypotheticals: continuations versus callbacks
At 9:14 PM +0100 3/19/03, Matthijs van Duin wrote: On Wed, Mar 19, 2003 at 02:31:58PM -0500, Dan Sugalski wrote: Well, I'm not 100% sure we need it for rules. Simon's point is well-taken, but on further reflection what we're doing is subclassing the existing grammar and reinvoking the regex engine on that subclassed grammar, rather than redefining the grammar actually in use. The former doesn't require runtime redefinitions, the latter does, and I think we're going to use the former scheme. That's not the impression I got from Simon It would also be rather annoying.. think about balanced braces etc, take this rather contrieved, but valid example: $x ~~ m X { macro ... yada yada yada; } X; It seems to be that you're really inside a grammar rule when that macro is defined. Right. Macro definition ends, you subclass off the parser object, then immediately call into it, and it eats until the end of the regex, at which point it exits and so does the parent, for lack of input, and the resulting parse tree is turned to bytecode and executed. Otherwise you'd have to keep a lot of state outside the parser to keep track of such things, which is exactly what perl grammars were supposed to avoid I think. You, as a user-level programmer, don't have to track the state. The parser code will, but that's not a big deal. We'll need to meet in the middle.. Well, not to be too cranky (I'm somewhat ill at the moment, so I'll apologize in advance) but... no. No, we don't actually have to, though if we could that'd be nice. OK, strictly speaking that's true, but I think we can Semantics. Until Larry's nailed down what he wants, there are issues of reestablishing hypotheticals on continuation reinvocation, They should be though, if a variable was hypothesized when the continuation was taken, then it should be hypothesized when that continuation is invoked. Should they? Does hypotheticalization count as data modification (in which case it shouldn't) or control modification (in which case it should), and do you restore the hypothetical value at the time the continuation was taken or just re-hypotheticalize the variables? (Which makes continuations potentially more expensive as you need to then save off more info so on invocation you can restore the hypothetical state) What about co-routines, then? And does a yield from a coroutine count as normal or abnormal exit for pushing of hypothetical state outward, or doesn't it count at all? flushing those hypotheticals multiple times, Not idea what you mean I hypotheticalize the variables. I then take a continuation. Flow continues normally, exits off the end normally, hypothetical values get pushed out. I invoke the continuation, flow continues, exits normally. Do I push the values out again? what happens to hypotheticals when you invoke a continuation with hypotheticals in effect, Basically de-hypothesize all current hypotheticals, How? Successfully or unsuccessfully? Does it even *count* as an exit at all if there's a pending continuation that could potentially exit the hypotheticalizing block later? what happens to hypotheticals inside of coroutines when you establish them then yield out, This follows directly from the implementation of coroutines: the first yield is a normal return, so if you hypothesize $x before that it'll stay hypothesized. if you then hypothesize $y outside the coroutine and call the coroutine again, $y will be de-hypothesized. Why? That doesn't make much sense, really. If a variable is hypotheticalized outside the coroutine when I invoke it, the coroutine should see the hypothetical variable. But what about yields from within a couroutine that's hypotheticalized a variable? That's neither a normal nor an abnormal return, so what happens? If the coroutine then hypothesizes $z and yields out, $z will be de-hypothesized and $y re-hypothesized. $x will be unaffected by all this Yech. I don't think that's the right thing to do. and when hypotheticals are visible to other threads. I haven't thought of that, but to be honest I'm not a big fan of preemptive threading anyway. Doesn't matter whether you like it or not, they're a fact that must be dealt with. (And scare up a dual or better processor machine and I'll blow the doors off a cooperative threading scheme, synchronization overhead or not) I read through your proposal (I'm assuming it's the one that started this Sounds like a good deal? :-) At the moment, no. It seems like a potentially large amount of overhead for no particular purpose, really. I have to admit I don't know the details of how your system works, but what I had in mind didn't have any extra overhead at all -- under the (apparently still debatable) assumption that you need to look up subrules at runtime anyway. You do agree that if that is possible, is *is* a good deal? No. Honestly I still don't see the *point*, certainly not in regards to regular expressions and rul
Re: Rules and hypotheticals: continuations versus callbacks
On Wed, Mar 19, 2003 at 10:38:54AM +0100, Leopold Toetsch wrote: I would propose, estimate the ops you need and test it :) I haven't completed testing yet, however it's becoming clear to me that this is likely to be a pointless effort There are so many variables that can affect performance here that the results I may find in these tests are unlikely to have any relation to the performance of rules in practice. 1. making continuations affects the performance of *other* code (COW) 2. the "let" operation is missing and all attempts to fake it are silly 3. to really test it, I'd need to make subrules full subroutines, but then the performance difference will probably disappear in the overhead of all other stuff. To test I'd need large, realistic patterns; and I'm certainly not in the mood to write PIR for them manually. And it appears that on my machine continuations and garbage collection have a quarrel, which also makes testing problematic. I guess the only way to find out is to implement both systems and compare them using a large test set of realistic grammars. Or ofcourse just implement it using continuations (system #1), since the speed difference probably isn't gonna be huge anyway. Here is my test program for continuation and the results on my machine: # "aaab" ~~ / ^ [ a | a* ] ab / set I5, 1000 sweepoff# or bus error collectoff # or segmentation fault begin: set S0, "aaab" set I0, 0 new P0, .Continuation set_addr I1, second set P0, I1 rx_literal S0, I0, "a", backtrack branch third second: new P0, .Continuation set_addr I1, fail set P0, I1 deeper: rx_literal S0, I0, "a", third save P0 # normally hypothesize new P0, .Continuation set_addr I1, unwind set P0, I1 branch deeper unwind: dec I0 # normally de-hypothesize restore P0 # normally de-hypothesize third: rx_literal S0, I0, "ab", backtrack sub I0, 2 # normally de-hypothesize backtrack: invoke fail: dec I5 if I5, begin end OPERATION PROFILE CODE OP FULL NAME CALLS TOTAL TIMEAVG TIME - --- -- -- 0 end 10.290.29 40 set_addr_i_ic 50000.0249280.05 46 set_i_ic 10010.0105730.11 60 set_s_sc 10000.0057170.06 66 set_p_i 50000.0162010.03 213 if_i_ic 10000.0028480.03 274 dec_i 40000.0113900.03 370 sub_i_ic 10000.0042270.04 675 save_p30000.1923090.64 682 restore_p 30000.2464570.82 719 branch_ic 40000.0122160.03 770 sweepoff 10.140.14 772 collectoff 10.030.03 786 new_p_ic 50000.1794030.36 819 invoke50000.0262850.05 962 rx_literal_s_i_sc_ic 10.0542600.05 - --- -- -- 16 480040.7868610.16 iBook; PPC G3; 700 Mhz -- Matthijs van Duin -- May the Forth be with you!
Re: Rules and hypotheticals: continuations versus callbacks
On Tue, Mar 18, 2003 at 09:28:43PM -0700, Luke Palmer wrote: Plan 1: Pass each rule a I continuation (rather than a backtrack one), and have it just return on failure. The big difference between this and your example is that Cs are now implemented just like Cs. Too bad C needs non-regex behavior, too. That's mechanism #2, not #1 You probably don't mean the word "continuation" here though, since a continuation never returns, so once a rule would invoke the success "continuation" it can't regain control except via another continuation You probably simply mean a closure Plan 2: Call subrules as plain old subs and have them throw a backtrack exception on failure (or just return a failure-reporting value... same difference, more or less). But.. say you have: Would would this be implemented? When bar fails, it needs to backtrack into foo, which has already returned. Are you saying every rule will be an explicit state machine? This has the advantage that C behaves consistently with the rest of Perl What do you mean? I looked around in Parrot a little, and it seems like continuations are done pretty efficiently. Yes, I noticed that do -- Matthijs van Duin -- May the Forth be with you!
Re: A6: Quick questions as I work on Perl6::Parameters
On Wed, Mar 19, 2003 at 10:34:26AM +1100, Damian Conway wrote: > Brent Dax wrote: > > > First of all, Larry, do you have *any* idea how difficult you're making > > my life? :^) This stuff is damn hard to implement--I'm at 450 lines > > and counting, and I haven't even started the semantic analysis stuff, > > let alone *tested* anything. At least I have Filter::Simple's > > FILTER_ONLY and P::RD to help me--without them, I'd be lost. Thanks, > > Damian! (Perl 6 regexes map pretty well onto P::RD, but then I guess I > > should consider the source. :^) ) > > Just wait till you see P::RD's successor: Perl6::Rules ;-) You're just trying to pre-bias all the sucks-o-meters... ( eg http://home.earthlink.net/~mrob/pub/lang_srom.html http://www.lehigh.edu/~sol0/rules.html ) Nicholas Clark
A6, parameters
If one deploys a sub with optional-positional parameters, one can't add a required parameter later without breaking any existing calls that make use of any o-p parameters. Right? A6: "An rw parameter may only default to a valid lvalue. If you find yourself wanting it to default to an ordinary value because it's undefined, perhaps you really want //= instead:" Wouldn't it make sense to allow //= in the sig and have it dwim? A6: "All unbound named arguments are bound to elements of the slurpy hash, if one was declared. If no slurpy hash is declared, an exception is thrown (although some standard methods, like BUILD, will provide an implicitly declared slurpy hash--known as %_ by analogy to @_--to handle surplus named arguments)." What about there always being an implied %_ and @_, but if either get entries, perl generates a warning (unless it's one of the methods like BUILD)? A6: "It's also possible to transition directly from positional arguments to the variadic list if optional positional arguments have been completely specified." Or if one uses pipes, right? -- ralph
Re: Rules and hypotheticals: continuations versus callbacks
At 10:05 AM +0100 3/19/03, Matthijs van Duin wrote: But.. say you have: Would would this be implemented? When bar fails, it needs to backtrack into foo, which has already returned. Are you saying every rule will be an explicit state machine? By compile-time interpolation. isn't so much a subroutine as a macro. For this to work, if we had: foo: \w+? bar: [plugh]{2,5} then what the regex engine *really* got to compile would be: (\w+?) ([plugh]{2,5}) with names attached to the two paren groups. Treating them as actual subroutines leads to madness, continuations don't quite work, and coroutines could pull it off if we could pass data back into a coroutine on reinvocation, but... We do, after all, want this fast, right? -- Dan --"it's like this"--- Dan Sugalski even samurai [EMAIL PROTECTED] have teddy bears and even teddy bears get drunk
Re: Rules and hypotheticals: continuations versus callbacks
On Wed, Mar 19, 2003 at 03:46:50PM -0500, Dan Sugalski wrote: Right. Macro definition ends, you subclass off the parser object, then immediately call into it ... You, as a user-level programmer, don't have to track the state. The parser code will, but that's not a big deal. OK, I suppose that works although that still means you're moving the complexity from the perl implementation to its usage: in this case, the perl 6 parser which is written in perl 6 -- but I can well imagine other people want to do the same, and they'll have to do a similar hack. I really don't like that, perl normally moves the complexity away from the programmer and into perl. They should be though, if a variable was hypothesized when the continuation was taken, then it should be hypothesized when that continuation is invoked. Should they? Does hypotheticalization count as data modification (in which case it shouldn't) or control modification (in which case it should), Isn't that the whole point of hypotheses in perl 6? You talk about "successful" and "unsuccessful" de-hypothesizing, and about abormals exits etc.. you seem to have a much complexer model of hypotheses than what's in my head. I could be entirely missing things ofcourse, but I haven't seen any evidence to that yet. To me, what 'let' does is temporize a variable except it doesn't get restored if you leave the scope, but only if you use a continuation to go back to a point where it wasn't hypothesized yet. When the last continuation taken *before* the hypothesis is gone, so is the old version and thus the hypothesized variable becomes permanent. The behavior regarding coroutines followed naturally from this, and so does the behavior inside regexen if they use the "continuation" semantics for backtracking -- which is what I'm suggesting. This leave only behavior regarding preemptive threads, which is actually very easy to solve: disallow hypothesizing shared variables -- it simply makes no sense to do that. Now that I think of it, temporizing shared variables is equally bad news, so this isn't something new. (Which makes continuations potentially more expensive as you need to then save off more info so on invocation you can restore the hypothetical state) Actually, I think 'let' can handle this.. it's only invocation of continuations that will become more expensive because it needs to deal with the hypothesized variables What about co-routines, then? And does a yield from a coroutine count as normal or abnormal exit for pushing of hypothetical state outward, or doesn't it count at all? Your terminology gets rather foreign to me at this point. Assuming a co-routine is implemented using continuations, their behavior follows directly from the description above, and I think the resulting behavior looks fine. I don't see why people would hypothesize variables inside a co-routine anyway. I hypotheticalize the variables. I then take a continuation. Flow continues normally, exits off the end normally, hypothetical values get pushed out. I invoke the continuation, flow continues, exits normally. Do I push the values out again? If it ends normally, the variable isn't de-hypothesized at all. Also, the continuation was created *after* you hypothesized the variable, so when you invoke it nothing will happen to the variable. How? Successfully or unsuccessfully? Does it even *count* as an exit at all if there's a pending continuation that could potentially exit the hypotheticalizing block later? You're making 0% sense to me, apparently because your mental model of hypothesizing differs radically from mine. Why? That doesn't make much sense, really. Probably the same problem in opposite direction :-) (And scare up a dual or better processor machine and I'll blow the doors off a cooperative threading scheme, synchronization overhead or not) Ofcourse, for CPU-intensive applications that spread their computation over multiple threads on a multi-processor machine, you'll certainly need preemptive multithreading. When exactly is the last time you wrote such an application in perl? :-) Seriously though, I think in the common case cooperative threading is likely to be superior.. it has low overhead, it should have faster context switch time, you have no synchronization issues, and you can mostly avoid the need for explicit yielding: in many applications threads will regularly block on something anyway (which will yield to another thread) But anyway, this is getting off-topic.. I'll save it for later. Regex first No. Honestly I still don't see the *point*, certainly not in regards to regular expressions and rules. The hypothetical issues need dealing with in general for threads, coroutines, and continuations, but I don't see how any of this brings anything to rules for the parsing engine. The flow control semantics the regex/parser needs to deal with are small and simple. I just don't see the point of trying to make
Re: A6 Request: Change .req to .arity
Larry Wall wrote: [...] [I wrote:] : maybe it's because I don't think a : function's arity is quite the same as it's *minimum* number of : parameters? I mean, it makes sense in a functional language... but you : don't have functions with a variable number of arguments there. Sure, but one can imagine having functions with a given arity that can nonetheless be modified adverbially. In this view, required parameters contribute to "arity", but optional parameters are only used for, er, options. I can see your point, but I still think this is kind of warping the way people think of an n-ary function (if they do think of functions to have an arity in the first place). Steffen -- @n=([283488072,6076],[2105905181,8583184],[1823729722,9282996],[281232, 1312416],[1823790605,791604],[2104676663,884944]);$b=6;@c=' -/\_|'=~/./g ;for(@n){for$n(@$_){map{$h=int$n/$b**$_;$n-=$b**$_*$h;[EMAIL PROTECTED] 0..11;[EMAIL PROTECTED],[EMAIL PROTECTED];[EMAIL PROTECTED]"\n"[EMAIL PROTECTED];
Re: Synopsis 6 (draft 0)
Thanks for the valuable feedback, Hugo. As usual, the default response is unqualified agreement. None of the operator examples show a typed return value. Can they? Should they? Yes. And yes. And now one does. ;-) :Method invocants are specified in the parameter list, with a colon :terminating the list of invocants: Hmm, is "method" intended to refer to "method, submethod or multimethod" here? We could do with a (different) name to refer to that collection. Agreed. :=item C/C/C/C/C/etc. : :Mark blocks that are to be unconditionally executed before/after :the subroutine's C block. The return values of these blocks :are ignored. s/unconditionally/conditionally/? Not sure. Yes. I originally only had FIRST and LAST. :Another way to modify the behavior of temporized code structures :is to give them a C block: It isn't clear to me from this or the example what the effect of the TEMP block is. Welcome to the club! ;-) I'll endeavour to explain the semantics better. :The standard C> function is equivalent to C$^v} I> That looks wrong: surely C< {$^k=>$^v} > must be parsed as a hashref here, and therefore (I guess) give a syntax error? I don't think so. The rule is that the top level construct in the braces has to be a *list* for the braces to be a hash. C<$^k=>$^v> isn't a list (though C<$^k=>$^v,> would be). Besides, the presence of placeholders means the block must be a closure. -print $text while $count-->0 +print $text while $count-- > 0 No way! That's my very favorite Perl pun! ;-) Thanks again, Damian
Re: Rules and hypotheticals: continuations versus callbacks
[EMAIL PROTECTED] (Matthijs Van Duin) writes: > OK, I suppose that works although that still means you're moving the > complexity from the perl implementation to its usage: in this case, > the perl 6 parser which is written in perl 6 No, I don't believe that's what's happening. My concern is that at some point, there *will* need to be a bootstrapped parser which is written in some low level language, outputting Parrot bytecode, and it *will* need to be able to reconfigure itself mid-match. I think. I can't remember why I'm so convinced of this, and I'm too tired to think it through with examples right now, and I might be wrong anyway, but at least I can be ready with a solution if it proves necessary. :) -- "There is no safe investment. To love at all is to be vulnerable. ... The only place outside Heaven where you can be perfectly safe from all the dangers and pertubations of love is Hell." -CS Lewis "The Four Loves"
Re: Rules and hypotheticals: continuations versus callbacks
On Wed, Mar 19, 2003 at 10:40:02AM -0500, Dan Sugalski wrote: By compile-time interpolation. isn't so much a subroutine as a macro. For this to work, if we had: foo: \w+? bar: [plugh]{2,5} then what the regex engine *really* got to compile would be: (\w+?) ([plugh]{2,5}) with names attached to the two paren groups. Treating them as actual subroutines leads to madness, Ehm, Foo.test cannot inline Foo.foo since it may be overridden: grammar Foo { rule foo { \w+? } rule bar { [plugh]{2,5} } rule test { } } grammar Bar is Foo { rule foo { +? } } What you say is only allowed if I put "is inline" on foo. continuations don't quite work Care to elaborate on that? I'd say they work fine We do, after all, want this fast, right? Ofcourse, and we should optimize as much as we can - but not optimize *more* than we can. Rules need generic backtracking semantics, and that's what I'm talking about. Optimizations to avoid the genericity of these backtracking semantics is for later. -- Matthijs van Duin -- May the Forth be with you!
Re: Rules and hypotheticals: continuations versus callbacks
At 4:52 PM +0100 3/19/03, Matthijs van Duin wrote: On Wed, Mar 19, 2003 at 10:40:02AM -0500, Dan Sugalski wrote: By compile-time interpolation. isn't so much a subroutine as a macro. For this to work, if we had: foo: \w+? bar: [plugh]{2,5} then what the regex engine *really* got to compile would be: (\w+?) ([plugh]{2,5}) with names attached to the two paren groups. Treating them as actual subroutines leads to madness, Ehm, Foo.test cannot inline Foo.foo since it may be overridden: grammar Foo { rule foo { \w+? } rule bar { [plugh]{2,5} } rule test { } } grammar Bar is Foo { rule foo { +? } } What you say is only allowed if I put "is inline" on foo. At the time I run the regex, I can inline things. There's nothing that prevents it. Yes, at compile time it's potentially an issue, since things can be overridden later, but that's going to be relatively rare, and can be dealt with by selective recompilation. By the time the regex is actually executed, it's fully specified. By definition if nothing else--you aren't allowed to selectively redefine rules in the middle of a regex that uses those rules. Or, rather, you can but the update won't take effect until after the end of the regex, the same way that you can't redefine a sub you're in the middle of executing. (And yes, I'm aware that if you do that you'll pick up the new version if you recursively call, but that won't work with regexes) continuations don't quite work Care to elaborate on that? I'd say they work fine There's issues with hypothetical variables and continuations. (And with coroutines as well) While this is a general issue, they come up most with regexes. We do, after all, want this fast, right? Ofcourse, and we should optimize as much as we can - but not optimize *more* than we can. Rules need generic backtracking semantics, and that's what I'm talking about. No. No, in fact they don't. Rules need very specific backtracking semantics, since rules are fairly specific. We're talking about backtracking in regular expressions, which is a fairly specific generality. If you want to talk about a more general backtracking that's fine, but it won't apply to how regexes backtrack. -- Dan --"it's like this"--- Dan Sugalski even samurai [EMAIL PROTECTED] have teddy bears and even teddy bears get drunk
Re: is static?
On Wed, Mar 19, 2003 at 01:18:48PM +0100, Angel Faus wrote: : Is this equivalent to "prof $foo = 0"? If it is not, I would claim : this to be a major violation of the principle of minor surprise. :-) : Maybe it would be saner to use: : : prop $foo is default(0); I suspect you mean prop $foo is init(0); since "default" probably brings to mind //= semantics. : Otherwise the mythical unwarned user would think that "$foo = 0" is : executed on every call to the subroutine. Well, people *will* write state $foo = 0; The question is what that should mean, and which major set of people we want to give the minor surprise to, and how much effort we want to expend in training to avoid the surprise in the first place. There's something to be said for keeping = as assignment outside of sigs. But it's not clear whether that = is part of a sig... And there really are a number of possible gradations of meaning: state $foo = BEGIN { x() } # compile time state $foo = CHECK { x() } # delayed compile time state $foo = INIT { x() } # process start time state $foo = START { x() } # on first call to func? state $foo = FIRST { x() } # before each call (bad) Most use will be to init with a compile-time constant, so I suppose we could train people to just say: state $foo ::= 0; We don't have a word for "START" right now. It's somewhat equivalent to state $foo //= 0 unless $foo gets undefined, I suppose. Larry
Re: Rules and hypotheticals: continuations versus callbacks
On Wed, Mar 19, 2003 at 11:09:01AM -0500, Dan Sugalski wrote: At the time I run the regex, I can inline things. There's nothing that prevents it. Yes, at compile time it's potentially an issue, since things can be overridden later, OK, but that's not how you initially presented it :-) you aren't allowed to selectively redefine rules in the middle of a regex that uses those rules. Or, rather, you can but the update won't take effect until after the end I don't recall having seen such a restriction mentioned in Apoc 5. While I'm a big fan of optimization, especially for something like this, I think we should be careful with introducing mandatory restrictions just to aid optimization. ("is inline" will allow such optimizations ofcourse) There's issues with hypothetical variables and continuations. (And with coroutines as well) While this is a general issue, they come up most with regexes. I'm still curious what you're referring to exactly. I've outlined possible semantics for hypothetical variables in earlier posts that should work. We do, after all, want this fast, right? Ofcourse, and we should optimize as much as we can - but not optimize *more* than we can. Rules need generic backtracking semantics, and that's what I'm talking about. No. No, in fact they don't. Rules need very specific backtracking semantics, since rules are fairly specific. We're talking about backtracking in regular expressions, which is a fairly specific generality. If you want to talk about a more general backtracking that's fine, but it won't apply to how regexes backtrack. My impression from A5 and A6 is that rules are methods. They're looked up like methods, they can be invoked like methods, etc. I certainly want to be able to write rules myself, manually, when I think it's appropriate; and use these as subrules in other methods. Generic backtracking semantics are needed for that, and should at least conceptually also apply to normal rules. When common sub-patterns are inlined, simple regexen will not use runtime subrules at all, so the issue doesn't exist there - that covers everything you would do with regexen in perl 5 for example. When you do use real sub-rules, you're getting into the domain previously held by Parse::RecDescent and the like. While these should ofcourse still be as fast as possible, a tiny bit of overhead on top of regular regex is understandable. However, such overhead might not be even needed at all: whenever possible optimizations should be applied, and rules are free to use special hacky but fast calling semantics to subrules if they determine that's possible. But I don't think a special optimization should be elevated to the official semantics. I say, make generic semantics first, and then optimize the heck out of it. -- Matthijs van Duin -- May the Forth be with you!
Re: Rules and hypotheticals: continuations versus callbacks
Matthijs van Duin wrote: sweepoff# or bus error collectoff# or segmentation fault Please try : /* set this to 1 for tracing the system stack and processor registers */ #define TRACE_SYSTEM_AREAS 1 in dod.c (works for me). Though I don't know, if processor registers on PPC gets traced by this (it might not stand optimization if not). Code is not that deeply looked at, that we can savely turn off stack tracing yet. leo
Re: Rules and hypotheticals: continuations versus callbacks
On Wed, Mar 19, 2003 at 11:09:01AM -0500, Dan Sugalski wrote: > By the time the regex is actually executed, it's fully specified. By > definition if nothing else--you aren't allowed to selectively > redefine rules in the middle of a regex that uses those rules. Or, > rather, you can but the update won't take effect until after the end > of the regex, the same way that you can't redefine a sub you're in > the middle of executing. (And yes, I'm aware that if you do that > you'll pick up the new version if you recursively call, but that > won't work with regexes) Are you implying that $fred = rx/fred/; $string ~~ m:w/ <$fred> { $fred = rx/barney/; } rubble / won't match "barney rubble"? -Scott -- Jonathan Scott Duff [EMAIL PROTECTED]
Re: Rules and hypotheticals: continuations versus callbacks
On Wed, 19 Mar 2003, Jonathan Scott Duff wrote: > Are you implying that > > $fred = rx/fred/; > $string ~~ m:w/ <$fred> { $fred = rx/barney/; } rubble / > > won't match "barney rubble"? Or, worse, that $fred = rx/fred/; $string ~~ m:w/ { $fred = rx/barney/; } <$fred> rubble / won't, either? /s
Re: Rules and hypotheticals: continuations versus callbacks
At 10:41 AM -0600 3/19/03, Jonathan Scott Duff wrote: On Wed, Mar 19, 2003 at 11:09:01AM -0500, Dan Sugalski wrote: By the time the regex is actually executed, it's fully specified. By definition if nothing else--you aren't allowed to selectively redefine rules in the middle of a regex that uses those rules. Or, rather, you can but the update won't take effect until after the end of the regex, the same way that you can't redefine a sub you're in the middle of executing. (And yes, I'm aware that if you do that you'll pick up the new version if you recursively call, but that won't work with regexes) Are you implying that $fred = rx/fred/; $string ~~ m:w/ <$fred> { $fred = rx/barney/; } rubble / won't match "barney rubble"? Potentially, no. What, then, should happen if you do: $barney = rx/barney/; $string = "barney rubble"; $string ~~ m:w/ <$barney> { $barney = rx/fred/; } rubble /; The regex shouldn't match, since you've invalidated part of the match in the middle. I can potentially see constructs of the form <$var> be taken as indirect rule invocations and their dispatch left to runtime, complete with the potential for bizarre after-the-fact invalidations, but as regex rules in the regex stream rather than as generic code. -- Dan --"it's like this"--- Dan Sugalski even samurai [EMAIL PROTECTED] have teddy bears and even teddy bears get drunk
Re: Rules and hypotheticals: continuations versus callbacks
At 5:38 PM +0100 3/19/03, Matthijs van Duin wrote: On Wed, Mar 19, 2003 at 11:09:01AM -0500, Dan Sugalski wrote: At the time I run the regex, I can inline things. There's nothing that prevents it. Yes, at compile time it's potentially an issue, since things can be overridden later, OK, but that's not how you initially presented it :-) Then I wasn't clear enough, sorry. This is perl -- the state of something at compile time is just a suggestion as to how things ultimately work. The state at the time of is the only thing that really matters, and I shortcut. you aren't allowed to selectively redefine rules in the middle of a regex that uses those rules. Or, rather, you can but the update won't take effect until after the end I don't recall having seen such a restriction mentioned in Apoc 5. I'll nudge Larry to add it explicitly, but in general redefinitons of code that you're in the middle of executing don't take effect immediately, and it's not really any different for regex rules than for subs. While I'm a big fan of optimization, especially for something like this, I think we should be careful with introducing mandatory restrictions just to aid optimization. ("is inline" will allow such optimizations ofcourse) Actually, we should be extraordinarily liberal with the application of restrictions at this phase. It's far easier to lift a restriction later than to impose it later, and I very much want to stomp out any constructs that will force slow code execution. Yes, I may lose, but if I don't try... My job, after all, is to make it go fast. If you want something that'll require things to be slow then I don't want you to have it. :) There's issues with hypothetical variables and continuations. (And with coroutines as well) While this is a general issue, they come up most with regexes. I'm still curious what you're referring to exactly. I've outlined possible semantics for hypothetical variables in earlier posts that should work. The issue of hypotheticals is complex. We do, after all, want this fast, right? Ofcourse, and we should optimize as much as we can - but not optimize *more* than we can. Rules need generic backtracking semantics, and that's what I'm talking about. No. No, in fact they don't. Rules need very specific backtracking semantics, since rules are fairly specific. We're talking about backtracking in regular expressions, which is a fairly specific generality. If you want to talk about a more general backtracking that's fine, but it won't apply to how regexes backtrack. My impression from A5 and A6 is that rules are methods. They're looked up like methods, they can be invoked like methods, etc. They aren't methods, though. They're not code in general, they're regex constructions in specific. Because they live in the symbol table and in some cases can be invoked as subs/methods doesn't make them subs or methods, it makes them regex constructs with funky wrappers if you want to use them in a non-regex manner. I certainly want to be able to write rules myself, manually, when I think it's appropriate; and use these as subrules in other methods. Generic backtracking semantics are needed for that, and should at least conceptually also apply to normal rules. No, no it shouldn't. Rule are rules for regexes, they are *not* subs. If you want generic backtracking to work, then there can't be any difference between: rule foo { \w+ } and sub foo { ... } but there must be. With rules as regex constructs the semantics are much simpler. If we allow rules to be arbitrary code not only do we have to expose a fair amount of the internals of the regex engine to the sub so it can actually work on the stream and note its position (which is fine, I can do that) we also need to be able to pause foo in the middle and jump back in while passing in parameters of some sort. Neither continuations nor standard coroutines are sufficient in this instance, since the reinvocation must *both* preserve the state of the code at the time it exited but also pass in an indication as to what the sub should do. For example, if the foo sub was treated as a rule and we backtrack, should it slurp more or less? If rules are just plain regex rules and not potentially arbitrary code, the required semantics are much simpler. Then there's the issue of being able to return continuations from within arbitrary unnamed blocks, since the block in this: $foo ~~ m:w/ {...} /; should be able to participate in the backtracking activities if we're not drawing a distinction between rules and generic code. (Yeah, the syntax is wrong, but you get the point) Ultimately the question is "How do you backtrack into arbitrary code, and how do we know that the arbitrary code can be backtracked into?" My answer is we don't, but I'm not sure how popular that particular answer is. When common sub-patterns are inlined, simple regexen will not use runtime subrules at all, so the issue does
Re: A6: Quick questions as I work on Perl6::Parameters
On Tue, Mar 18, 2003 at 11:36:13PM +, Simon Cozens wrote: Seriously, someone on IRC the other day was claiming that they already had a P6RE-in-P5 implementation, and did show me some code, but I've forgotten where it lives or their real name. http://www.liacs.nl/~mavduin/P6P5_0.00_01.tar.gz That someone would be me :-) I should note that this version has wrong semantics: it is incapable of doing backtracking in various cases including subrules. It only does backtracking within subexpressions that only does "simple" regex (because I translate that to a single perl 5 regex match) My attempt at getting its semantics right is what triggered all my recent backtracking-related posts. (since continuations are unavailable, it looks like I'll have to settle for the callback system) -- Matthijs van Duin -- May the Forth be with you!
Re: Statement modifiers (yes, again)
On Tue, Mar 18, 2003 at 08:53:23PM -0700, Luke Palmer wrote: How is a left-associative operator "less special" than a non-associative one? Ehm, most operators in perl are left-associative, so you probably mean R2L short-circuiting but even then I'm not sure what you're trying to say here And you speak of consistency, but wouldn't it be better to have C be consistent with C and C rather than C and C? (Seeing as C is explicitly a control-flow construct) 'and' is a flow-control construct too.. "foo if bar" and "bar and foo" work identically. Behaviorally 'if' is grouped with 'and'. But I suppose based on the name people will group the 'if' modifier with 'for' rather than with 'and'.. Then they'll assume they can do: FOO for @BAR while $BAZ; dunno.. people try all sorts of things that can't actually be done, but I suppose in this case it's a plausible extrapolation. I guess to be honestly "consistent" all modifiers would have to become operators, which would bring us back to the multiple statement modifiers to which Larry said no.. I'll rest my case -- Matthijs van Duin -- May the Forth be with you!
random code snippet (cooperative threading)
I recently fiddled around a bit with how one might implement cooperative threading in perl 6 using call-with-current-continuation (callcc), so I thought I'd share it with you (since continuations are often poorly understood, more examples is always better :-) This is assuming callcc exists and that the continuation is topic inside it; if that's not a built-in then callcc can no doubt be defined using whatever is available to make continuations. Enjoy! my @queue; sub thread_exit () { shift(@queue)(undef); exit; } sub thread_new (&code) { push @queue, callcc { unshift @queue, $_; code; thread_exit; } } sub thread_yield () { if @queue { my $prev = callcc shift @queue; push @queue, $prev if $prev; } } thread_new { for 1..10 -> $i { print "foo: $i\n"; thread_yield; } } thread_new { for 1..10 -> $i { print "bar: $i\n"; thread_yield; } } thread_new { for 1..10 -> $i { print "baz: $i\n"; thread_yield; } } thread_exit; -- Matthijs van Duin -- May the Forth be with you!
Re: Rules and hypotheticals: continuations versus callbacks
Matthijs van Duin wrote: Which system is likely to run faster on parrot? I would propose, estimate the ops you need and test it :) E.g. call a continuation 1e6 times and communicate state with one global (a lexical is probably the same speed, i.e. a hash lookup) $ cat a.pasm new P5, .PerlInt set P5, 100 store_global "foo", P5 new P0, .Continuation set_addr I0, endcont set P0, I0 endcont: find_global P4, "foo" unless P4, done dec P4 # store_global "foo", P4 --no need to store, P4 is a reflike thingy invoke done: print "done\n" end $ time imcc -P a.pasm done real0m0.881s $ imcc -p a.pasm done OPERATION PROFILE CODE OP FULL NAMECALLS TOTAL TIMEAVG TIME - - --- -- -- 0 end 10.010.01 40 set_addr_i_ic 10.010.01 66 set_p_i 10.010.01 67 set_p_ic10.040.04 226 unless_p_ic 1010.5950250.01 276 dec_p 1000.5999460.01 758 store_global_sc_p 10.060.06 760 find_global_p_sc 1011.0379220.01 786 new_p_ic20.110.05 819 invoke1000.9140630.01 883 print_sc10.0052990.005299 - - --- -- -- 114103.1522800.01 So you can estimate that the more "heavy" opcodes take about 1us, more "light" vtable functions are ~double that speed with the slow, profiling core. CGP (or JIT) is 3 - 4 times faster. -O3 compiled parrot, Athlon 800, i386/linux leo
Re: Rules and hypotheticals: continuations versus callbacks
On Wed, Mar 19, 2003 at 10:38:54AM +0100, Leopold Toetsch wrote: I would propose, estimate the ops you need and test it :) Hmm, good point Or even better.. I should just implement both examples and benchmark them; they're simple enough and the ops are available. I guess it's time to familiarize myself with pasm :) -- Matthijs van Duin -- May the Forth be with you!
Re: is static?
> block. Perhaps we should just go with that: > > property $foo = 0; > > Or whatever word we choose, I don't care: > > prop $foo = 0; > What about: prof $foo; $foo = 0; Is this equivalent to "prof $foo = 0"? If it is not, I would claim this to be a major violation of the principle of minor surprise. Maybe it would be saner to use: prop $foo is default(0); Otherwise the mythical unwarned user would think that "$foo = 0" is executed on every call to the subroutine. -angel
Re: Rules and hypotheticals: continuations versus callbacks
On Wed, Mar 19, 2003 at 01:01:28PM +0100, Matthijs van Duin wrote: On Wed, Mar 19, 2003 at 10:38:54AM +0100, Leopold Toetsch wrote: I would propose, estimate the ops you need and test it :) Hmm, good point Or even better.. I should just implement both examples and benchmark them; they're simple enough and the ops are available. except I forgot entirely about "let" however the implementation "let" will have impact on the performance of both systems.. oh well, I'll just have to estimate like you said :-) -- Matthijs van Duin -- May the Forth be with you!
Re: A6, parameters
On Tue, Mar 18, 2003 at 02:46:17PM -0600, Ralph Mellor wrote: : If one deploys a sub with optional-positional parameters, : one can't add a required parameter later without breaking : any existing calls that make use of any o-p parameters. : Right? Correct. : A6: "An rw parameter may only default to a valid lvalue. If : you find yourself wanting it to default to an ordinary value : because it's undefined, perhaps you really want //= instead:" : : Wouldn't it make sense to allow //= in the sig and have it dwim? Except that if you're planning to write the variable as an "out" parameter, you still have to have it bound to something, whether or not you later choose to default its value using //=. And I think putting both = and //= into the sig is confusing. Since //= has to be a run-time decision anyway, and has no influence on what gets bound to the parameter, it might as well go in the body. : A6: "All unbound named arguments are bound to elements of : the slurpy hash, if one was declared. If no slurpy hash is : declared, an exception is thrown (although some standard : methods, like BUILD, will provide an implicitly declared : slurpy hash--known as %_ by analogy to @_--to handle surplus : named arguments)." : : What about there always being an implied %_ and @_, but if : either get entries, perl generates a warning (unless it's : one of the methods like BUILD)? Then someone will turn off warnings to get the implicit %_ and @_ everywhere, and every subroutine will have to include the code to check for named parameters, which means every sub runs slower. The lack of implied %_ or @_ is meant to make it possible to compile faster sub calls when you know a sub can't have named parameters, or at least when you can know all the possible names of parameters at compile time, and can rearrange them into positional form internally from the caller end at compile time, so that the callee doesn't have to process them in random order at run time, which is slow. Even things like multimethods can have optimizations of this form. If all currently declared multimethods of a particular name have the same parameter names with different types, then on the assumption that more multimethods will not be declared under that name, you can optimize named notation into positional. Perhaps there will need to be a way to commit to a particular set of multimethods if you want to enable this optimization at CHECK time. Or maybe a way to declare that "all multimethods of this name must start with these n positional parameters with the following names..." (For the first version, we probably just restrict multimethod invocants to positional paramaters anyway. But the question is whether we can relax that at some point. Restricting multimethod invocants to positional parameter has the interesting side effect of prohibiting the currying of multimethod invocants, unfortunately.) : A6: "It's also possible to transition directly from positional : arguments to the variadic list if optional positional : arguments have been completely specified." : : Or if one uses pipes, right? Yes, that's an explicit variadic transition. Larry
Re: Rules and hypotheticals: continuations versus callbacks
[EMAIL PROTECTED] (Dan Sugalski) writes: > you aren't allowed to selectively redefine > rules in the middle of a regex that uses those rules. This is precisely what a macro does. -- "How should I know if it works? That's what beta testers are for. I only coded it." (Attributed to Linus Torvalds, somewhere in a posting)
Re: Rules and hypotheticals: continuations versus callbacks
At 5:47 PM + 3/19/03, Simon Cozens wrote: [EMAIL PROTECTED] (Dan Sugalski) writes: you aren't allowed to selectively redefine rules in the middle of a regex that uses those rules. This is precisely what a macro does. Not once execution starts, no. -- Dan --"it's like this"--- Dan Sugalski even samurai [EMAIL PROTECTED] have teddy bears and even teddy bears get drunk
Re: A6 Request: Change .req to .arity
Damian Conway wrote: Larry wrote: On the other hand, I could see an argument that said anyone who doesn't know what .arity means shouldn't be writing routines that depend on it... That was more or less my line of thought. Now, I think I'll dare claim my English is not exactly bad for a 21 year-old non-native speaker. Being a physics and CS student, I do also have mathematical background, but it still took me a few seconds to figure out "arity" *in this context*. Maybe that's because I can't think of an exact German equivalent either; maybe it's because I don't think a function's arity is quite the same as it's *minimum* number of parameters? I mean, it makes sense in a functional language... but you don't have functions with a variable number of arguments there. To cut this short: I think req or reqargs or somesuch would be better. Why choose the method names that sound more like computer science for the very sake of that? Steffen -- sub'_{q} tsuJ}}_();sub's{seek+DATA,0,0}sub'p{print&_}sub'r{reverse$_[0]} @_=(('')x2,split" ",);s!!&s,$_=;s/}.*?}/$_[$s+1]/ if$s;s/(}.*?})/r$1/e;eval$_;p,[EMAIL PROTECTED]; __DATA__ } rehtona} } lreP} },rekcah}
Re: Rules and hypotheticals: continuations versus callbacks
[EMAIL PROTECTED] (Dan Sugalski) writes: > At 5:47 PM + 3/19/03, Simon Cozens wrote: > >[EMAIL PROTECTED] (Dan Sugalski) writes: > >> you aren't allowed to selectively redefine > >> rules in the middle of a regex that uses those rules. > > > >This is precisely what a macro does. > > Not once execution starts, no. Compilation's just execution of a regex, albeit the Perl6::Grammar::program regex, and that regex will need to be modified while it's in operation in order to pick up macro "is parsed" definitions and apply them to the rest of what it's parsing. -- * DrForr digs around for a fresh IV drip bag and proceeds to hook up. Coffee port. Firewalled, like everything else around here.
Re: A6: Named vs. Variadic Parameters
On Sat, Mar 15, 2003 at 06:46:21PM -0800, mlazzaro wrote: : Luke Palmer wrote: : : > The idea is that positional parameters are always a contiguous : > sequence in the argument list. If it looked like this: : > : > sub foo($x, ?$y, +$k, [EMAIL PROTECTED]) {...} : > : > Then one might presume to call it like: : > : > foo($x, $y, $k, 1, 2, 3); : > : > Which they can't. So it makes sense to have everything positional up : > front, while things that can go anywhere (but must be labeled) in the : > back. : : I guess. The most confusing part is this: : : sub foo($x, +$k, [EMAIL PROTECTED]) {...}# (1) WRONG : sub foo($x, [EMAIL PROTECTED], +$k) {...}# (2) OK : : sub foo($x, +$k, *%h) {...}# (3) OK(?) : sub foo($x, *%h, +$k) {...}# (4) WRONG(?) : : Not only are (1) and (4) wrong, but they're always wrong... AFAICT, there's : no possible way to get $k correctly if you use those two forms. (Or maybe : it's (1) and (3) that are wrong, or maybe both (3) and (4) are OK and only : (1) is wrong; it depends on whether *%h is considered 'positional' or : 'named'. I'm not even remotely sure.) : : So I would emphatically hope that (1) and (4) produce compile-time errors, : at minimum. If we wanted to silently accept (1) and (4) as synonyms for : (2) and (3), that would be OK too. : : But PLEASE, PLEASE make them compile-time errors if they aren't going to : work! Depends on what you mean by "work". Those are all perfectly good declarations. But only (2) allows you to tack the slurpy list positionally to the end of the positional parameters. : sub foo($x, +$k, [EMAIL PROTECTED]) {...}# (1) WRONG Not wrong. It says you must set $k using named notation. If you want a list, it goes after "k => $x". If you don't want to pass k, you should use <== to mark the variadic transition. (I suspect some style guides will require <== on all list operators. Something to be said for that.) : sub foo($x, [EMAIL PROTECTED], +$k) {...}# (2) OK Fine, you can set @a using positional notation, like push(), in addition to the notations available to (1). But if you set "k =>", it has to be before the list, unless you pass the list explicitly as a "*@" named parameter. With the exception of [EMAIL PROTECTED] at the front as in (2), non-positional parameters don't pay any attention to their order of declaration. : sub foo($x, +$k, *%h) {...}# (3) OK(?) Says bind "k => $x" to $k, then put any other named parameters in %h (and maybe k's pair too, depending on implementation). : sub foo($x, *%h, +$k) {...}# (4) WRONG(?) Says exactly the same as (3). Non-positional parameters don't care what order they're declared. Larry
Re: Synopsis 6 (draft 0)
Err...sorry all...wrong Perl 6 mailing list. Just ignore it please. (But, yes, S6 will be out RSN!) Damian
Re: 99... (hint: not red baloons)
Douglas Hunter wrote: [snip] > branch ONE_BOTTLE # 'bout time for a liver transplant > > ONE_BOTTLE: # it's been fun The ONE_BOTTLE label isn't gone to by anything else in the program, and the one instruction which does go to it, is immediately before it. So, what's the reason for it? -- $a=24;split//,240513;s/\B/ => /for@@=qw(ac ab bc ba cb ca );{push(@b,$a),($a-=6)^=1 for 2..$a/6x--$|;print "[EMAIL PROTECTED] ]\n";((6<=($a-=6))?$a+=$_[$a%6]-$a%6:($a=pop @b))&&redo;}
Re: [RFC] maninfestation, was Re: Parrot binaries
Steve Fink wrote: I've attached a proof-of-concept patch that I'd like people to comment on in terms of general approach. Looks good to me. Some minor notes: - perl Configure.pl should probably have --floatval=double (for cases, where the perl in the path happens to have Duse_long_double) - the install prefix should include the %{version} string so that you are able to keep different executables around e.g. prefix=/usr/local/parrot/%{version}. Installing should soft link parrot to e.g. /usr/local/parrot/0.0.10/bin/parrot. leo
99... (hint: not red baloons)
Thank you everyone for bringing this fun tool to such a useful state so quickly. The progress Parrot has shown is impressive. I offer you my thanks in probably not-so-elegantly written pasm emitted song and drink. -- dug # Mmmm, beer good set I1, 99 # bottles of beer set I2, 1 # single bottle set S1, " of beer on the wall,\n" set S2, " of beer,\n" set S3, "Take one down, pass it around,\n" set S4, " of beer on the wall.\n\n" set S5, " bottle" set S6, " bottles" set S7, "No more bottles of beer on the wall.\n\n" set S8, "*Buuurrp*\n" branch N_BOTTLES # start drinking N_BOTTLES: print I1 print S6 print S1 print I1 print S6 print S2 print S3 dec I1 eq I1, I2, BOTTLE_OF # Aww, we're getting low print I1 print S6 print S4 ne I1, I2, N_BOTTLES # keep on drinking BOTTLE_OF: print I1 print S5 print S4 branch ONE_BOTTLE # 'bout time for a liver transplant ONE_BOTTLE: # it's been fun print I1 print S5 print S1 print I1 print S5 print S2 print S3 print S7 print S8 end
Re: parrotcode.org version info patch
Thanks, applied. Cal Henderson wrote: > > attached below sig. > > --cal > "just take a little look from our side when you can" > > > --- index.html Wed Mar 19 08:10:15 2003 > +++ index.html.new Wed Mar 19 08:11:03 2003 > @@ -62,9 +62,9 @@ > How to get the code > > Periodic releases will appear on CPAN; the current release is > -version 0.0.9, and can be found in the CPAN > +version 0.0.10, and can be found in the CPAN > > - HREF="http://www.cpan.org/modules/by-authors/id/S/SF/SFINK/parrot-0.0.9.tar.gz";>source > directory. > + HREF="http://www.cpan.org/modules/by-authors/id/S/SF/SFINK/parrot-0.0.10.tar.gz";>source > directory. > > >
Re: some comments on pdd 14 Big Numbers
Perhaps this might sound like a foolish suggestion, but... Wouldn't it be better (for some definition of better) to design our BigInt type so that we can replace the math engine with a different math engine? Eg, so we could plug in GMP, or PARI, or Miracl, etc.? (With a fallback to our own math code) Something like Perl5's Math::BigInt does. That way, if someone has a faster math engine, they can use by simply changing a configuration option (which backend BigInt will use), rather than having to change the code which creates the bigints. -- $a=24;split//,240513;s/\B/ => /for@@=qw(ac ab bc ba cb ca );{push(@b,$a),($a-=6)^=1 for 2..$a/6x--$|;print "[EMAIL PROTECTED] ]\n";((6<=($a-=6))?$a+=$_[$a%6]-$a%6:($a=pop @b))&&redo;}
Re: 99... (hint: not red baloons)
Someone please stick this man's ditty into the examples directory :) -R! Douglas Hunter wrote: > > Thank you everyone for bringing this fun tool to such a useful state so > quickly. The progress Parrot has shown is impressive. I offer you my > thanks in probably not-so-elegantly written pasm emitted song and drink. > > -- dug