date:20031113

Refactoring a test program: advice sought

2003-11-13 Thread Andrew Savige

As part of the phalanx project, I've added quite a few new tests to
02_methods.t in the Archive::Tar test suite. Though I'm jubilant the
new tests have uncovered a number of bugs, the test code itself has
been getting progressively uglier, ripe for refactoring, in fact.

To avoid code duplication between tests, I'd like to store common
code outside the tests. To see how others have solved this problem,
I took a look at YAML and ExtUtils-MakeMaker.

YAML has t/TestYAML.pm and its .t files do this:
  use lib 't';
  use TestYAML;
ExtUtils-MakeMaker has t/lib/MakeMaker/Test/Utils.pm and its .t
files do this:
  BEGIN {
  if( $ENV{PERL_CORE} ) {
  chdir 't' if -d 't';
  @INC = ('../lib', 'lib');
  }
  else {
  unshift @INC, 't/lib';
  }
  }
  chdir 't';
  use MakeMaker::Test::Utils;
Which model should I follow? Or are there better models out there?

Anyway, the current 02_methods.t is embedded below; any advice on
how it could be improved or refactored is most welcome! (I know it
has a bug in that it does not skip the correct number of tests).

/-\

use Test::More 'no_plan';
use strict;

use Cwd;
use IO::File;
use File::Path;
use File::Spec  ();
use File::Spec::Unix();
use File::Basename  ();

use Archive::Tar;
use Archive::Tar::Constant;

my $tar = Archive::Tar->new;
my $tarbin  = Archive::Tar->new;
my $tarx= Archive::Tar->new;

for my $obj ( $tar, $tarbin, $tarx ) {
isa_ok( $obj, 'Archive::Tar', 'Object created' );
}

my $file =
qq[directory/really-really-really-really-really-really-really-really-really-really-really-really-really-really-really-really-really-really-really-really-really-really-really-really-really-really-really-really-long-directory-name/myfile];

my $expect = {
c   => qr/^\s*$/,
d   => qr/^\s*$/,
};

my $all_chars = join '', "\r\n", map( chr, 0..255 ), "zzz\n\r";

### @expectbin is used to ensure that $tarbin is written in the right   ###
### order and that the contents and order match exactly when extracted  ###
my @expectbin = (
###  filename  contents   ###
['bIn11',  $all_chars x 11 ],
['bIn3',   $all_chars x  3 ],
['bIn4',   $all_chars x  4 ],
['bIn1',   $all_chars  ],
['bIn2',   $all_chars x  2 ],
);

### @expectx is used to ensure that $tarx is written in the right   ###
### order and that the contents and order match exactly when extracted  ###
my $xdir = 'x';
my @expectx = (
###  filename  contentsdirs ###
['k'   ,   '', [ $xdir ]],
[$xdir ,   'j',[ $xdir ]],   # failed before A::T 1.08
);

### wintendo can't deal with too long paths, so we might have to skip tests ###
my $TOO_LONG=   ($^O eq 'MSWin32' or $^O eq 'cygwin')
&& length( cwd(). $file ) > 247;

if( $TOO_LONG ) {
SKIP: {
skip( "No long filename support - long filename extraction disabled", 0
);
}
} else {
$expect->{$file} = qr/^hello\s*$/ ;
}

my @root = grep { length }   File::Basename::dirname($0),
'src', $TOO_LONG ? 'short' : 'long';

my $archive= File::Spec->catfile( @root, 'bar.tar' );
my $compressed = File::Spec->catfile( @root, 'foo.tgz' );
my $archivebin = File::Spec->catfile( @root, 'outbin.tar' );
my $compressedbin  = File::Spec->catfile( @root, 'outbin.tgz' );
my $archivex   = '0';
my $compressedx= '1';
my $zlib   = eval { require IO::Zlib; 1 };
my $NO_UNLINK  = scalar @ARGV ? 1 : 0;

### error tests ###
{
local $Archive::Tar::WARN  = 0;
my $init_err   = $tar->error;
my @list   = $tar->read();
my $read_err   = $tar->error;
my $obj= $tar->add_data( '' );
my $add_data_err   = $tar->error;

is( $init_err, '',  "The error string is empty" );
is( scalar @list, 0,"Function read returns 0 files on
error" );
ok( $read_err,  "   and error string is non empty" );
like( $read_err, qr/create/,"   and error string contains create"
);
unlike( $read_err, qr/add/, "   and error string does not contain
add" );
ok( ! defined( $obj ),  "Function add_data returns undef on
error" );
ok( $add_data_err,  "   and error string is non empty" );
like( $add_data_err, qr/add/,   "   and error string contains add" );
unlike( $add_data_err, qr/create/,  "   and error string does not contain
create" );
}

### read tests ###
my $gzip = 0;
for my $type( $archive, $compressed ) {

my $state = $gzip ? 'compressed' : 'uncompressed';

SKIP: {
   
skip(   "No IO::Zlib - can not read compressed archives",
4 + 2 * (scalar keys %$expect)
) if( $gzip and !$zlib);

{
my @list= $tar->read( $type );

Re: Refactoring a test program: advice sought

2003-11-13 Thread Michael G Schwern

On Thu, Nov 13, 2003 at 07:12:01PM +1100, Andrew Savige wrote:
> Which model should I follow? Or are there better models out there?

Well, since you're not a core module you don't have to worry about the
PERL_CORE stuff.  So just put your .pm file somewhere under t and use lib.

I use t/lib so the top level t/ directory doesn't get cluttered (and for
compatibility with the Perl core which may be important later for A::T).

> Anyway, the current 02_methods.t is embedded below; any advice on
> how it could be improved or refactored is most welcome! (I know it
> has a bug in that it does not skip the correct number of tests).

In addition to pulling out the functions, you probably want to break this
rather large test file out into several scripts.  Those "### here is what
the block below does" comments are a good guide.  If some sort of setup
is needed for the tests to run you might want to put that into a module
as well (look at MakeMaker::Test::Setup::* in a very recent MakeMaker for
an example).

-- 
Michael G Schwern[EMAIL PROTECTED]  http://www.pobox.com/~schwern/
Playstation?  Of course Perl runs on Playstation.
-- Jarkko Hietaniemi

seeking suggestion

2003-11-13 Thread senthil kumar

give some source code project in perl to help in cumbersome situation



Yahoo! India Mobile: Ringtones, Wallpapers, Picture Messages and more.Download now.

IMCC fixes

2003-11-13 Thread Melvin Smith

I've fixed nearly all of breakage with IMCC
that was introduced with the last large patch.
I'm currently trying to localize all APIs to the IMC_Unit
but I'm not quite there yet.
A hash test is failing, but I have no clue how my IMCC
work affected that code. I'm hoping it was already failing
before I synced?
-Melvin

Re: Unifying call and return

2003-11-13 Thread Leopold Toetsch

Dan Sugalski <[EMAIL PROTECTED]> wrote:

> YHO would be incorrect here. There's a lot of runtime mutability, and
> there's no guarantee that a sub or method has the same prototype at
> runtime that it did at compiletime.

*if* the pdds allow such weirdness with native types. Can we define
another property for such calls:

  .sub _weird prototyped check_args

I still think, that the *common* case (i.e. fixed and known arguments on
both sides) should be as fast as possible.

>   Dan

leo

Re: [CVS ci] hash compare

2003-11-13 Thread Leopold Toetsch

Dan Sugalski <[EMAIL PROTECTED]> wrote:

> You're going to run into problems no matter what you do, and as
> transcoding could happen with each comparison arguably you need to make a
> local copy of the string for each comparison, as otherwise you run the
> risk of significant data loss as a sring gets transcoded back and forth
> across a lossy boundary.

Here is again, what I already had proposed:
 * as long as there are only ascii keys: noop
 * on first non ascii key, convert all hash to utf8 - doesn't change
   hash values
 * then if key is non-ascii and non-utf8 transcode it in
   find_bucket() before string_compare

The hash (assuming ascii is used mainly) starts out with a compare
function pointing to a strcmp()-alike compare function. Each key that
enters the hash either for insert or for find is checked for its
encoding/type. When the first non-ascii key is inserted, hash keys are
converted to utf8 and the compare function pointer is changed to do utf8
compare. Non-ascii search keys are always transcoded to utf8 first - and
only once.

> Regardless, I think at least a single string copy with comparison against
> that copy within the hash functions is the only way to get correct
> results.

Yes. That's the point - a single string copy. Now each compare could do
a transcode i.e. generate a new string.

>   Dan

leo

Re: Arena flags and floating exception

2003-11-13 Thread Leopold Toetsch

Melvin Smith <[EMAIL PROTECTED]> wrote:
> When I compile with Electric Fence (linux Athlon XP)
> I get a floating point exception on startup.

valgrind doesn't show this problem - strange.

> -Melvin

leo

Re: [CVS ci] hash compare

2003-11-13 Thread Leopold Toetsch

Peter Gibbs <[EMAIL PROTECTED]> wrote:

> I would prefer this to be done via an iterator, as it would also solve
> the skip_backward problems with DBCS encoding. Something like:

There was a discussion, that current string iterators are wrong.
They should take a position argument (and start of string) instead the
pointer...

> typedef struct string_iterator_t {
>   String *str;
>   UINTVAL bytepos;
>   UINTVAL charpos;
>   UINTVAL (*decode_and_advance)(struct string_iterator_t *i);
> } string_iterator;

... which is done here.

> Does anybody think this is worth implementing?

Yes. Its a nice speed up and a clean interface.

> Peter Gibbs

leo

Re: Calling conventions. Again

2003-11-13 Thread Steve Fink

I'm getting a little confused about what we're arguing about. I will
take a stab at describing the playing field, so people can correct me
where I'm wrong:

Nonprototyped functions: these are simpler. The only point of
contention here is whether args should be passed in P5..P15,
overflowing into P3; or just everything in P3. Dan has stated at least
once that he much prefers the P5..P15, and there hasn't been much
disagreement, so I'll assume that that's the way it'll be.

Prototyped functions: there are a range of possibilities.

 1. Everything gets PMC-ized and passed in P3. (Oops, I wasn't going
to mention this. I did because Joe Wilson seemed to be proposing
this.) No arg counts.

 2. Everything gets PMC-ized and passed in P5..P15+P3. Ix is an arg
count for the number of args passed in P5..P15. P3 is empty if
argcount <= 11 (so you have to completely fill P5..P15 before
putting stuff in P3.)

 3. Same as above, but you can start overflowing into P3 whenever you
want. Mentioned for completeness. Not gonna happen.

In fact, anything above this point ain't gonna happen.

 4. PMCs get passed in P5..P15+P3, ints get passed in I5..I15+P3, etc.
Ix is a total argument count (number of non-overflowed PMC args +
number of non-overflowed int args + ...). Arguments are always
ordered, so it is unambiguous which ones were omitted in a varargs
situation. I think this is what Leo is arguing for.

 5. PMCs get passed in P5..P15+P3, ints get passed in I5..I15+P3, etc.
Ix is the number of non-overflowed PMC args, Iy is the number of
non-overflowed int args, etc. I think this is what Dan is arguing
for.

 6. PMCs get passed in Px..P15+P3, ints get passed in I5..I15+P4, etc.
Ix is the number of non-overflowed PMC args, Iy is the number of
non-overflowed int args, etc. I made this one up; see below.

Given that all different types of arguments get overflowed into the
same array (P3) in #4 and #5, #4 makes some sense -- if you want to
separate out the types, then perhaps it should be done consistently
with both argument counts _and_ overflow arrays. That's what #6 would
be. Note that it burns a lot of PMC registers.

The other question is how much high-level argument passing stuff (eg,
default values) should be crammed in. The argument against is that it
will bloat the interface and slow down calling. The argument for is
that it increases the amount of shared semantics between Parrot-hosted
languages. An example of how default values could be wedged in is to
say that any PMC parameter can be passed a Null PMC, which is the
signal to use the default value (which would need to be computed in
the callee, remember), or die loudly if the parameter is required.
Supporting optional integer, numeric, or string parameters would be
trickier. Or disallowed.

Hopefully I got all that right.

Re: IMCC fixes

2003-11-13 Thread Leopold Toetsch

Melvin Smith <[EMAIL PROTECTED]> wrote:
> I've fixed nearly all of breakage with IMCC
> that was introduced with the last large patch.

Great, thanks.

> A hash test is failing, but I have no clue how my IMCC
> work affected that code. I'm hoping it was already failing
> before I synced?

Yep. Obviously introduced by my hash value randomization patch.

> -Melvin

leo

Re: Calling conventions. Again

2003-11-13 Thread Leopold Toetsch

Steve Fink wrote:

Prototyped functions: there are a range of possibilities.
 2. Everything gets PMC-ized and passed in P5..P15+P3. Ix is an arg
count for the number of args passed in P5..P15. P3 is empty if
argcount <= 11 (so you have to completely fill P5..P15 before
putting stuff in P3.)
That is exactly the unprototyped case.

 4. PMCs get passed in P5..P15+P3, ints get passed in I5..I15+P3, etc.
Ix is a total argument count (number of non-overflowed PMC args +
number of non-overflowed int args + ...). Arguments are always
ordered, so it is unambiguous which ones were omitted in a varargs
situation. I think this is what Leo is arguing for.
Almost. An argument count doesn't help in the general case (e.g. against 
swapped params of the same type or such). So I'd just omit it for the 
probably more common case of fixed and known arguments.

But, we need something for varargs or even changed signatures (wherever 
they might come from). The HLL compiler shouldn't know much about the 
underlying calling conventions. OTOH the code generated for argument 
binding in the callee has to interface with the HLL code for providing 
values for missing (default) arguments.
We might need something like this

   .sub prototyped var_args# I1 is count of I-args
   .param int a# I5 when presemt
   .if_missing_param a # if I1 >= 1 goto a_is_valid
   # HLL code to set a # ...
   # a = some  # I5 = ...
   .end# a_is_valid:
So while its up to the HLL to produce the code for providing a default 
value, its up to our calling conventions to allow such code to 
interface, when a param is missing. Just having counts isn't enough to 
provide all we need. We must provide an interface for the HLL to fill in 
code for missing arguments.

Changed signatures can't be handled with counts anyway.


The other question is how much high-level argument passing stuff (eg,
default values) should be crammed in. 
See above.

... The argument against is that it
will bloat the interface and slow down calling. 
I'd really have fixed and vararg cases separated for this reason.
The fib() benchmark measuring raw call speed mainly, is already slow enough.
leo

Re: [CVS ci] hash compare

2003-11-13 Thread Dan Sugalski

On Thu, 13 Nov 2003, Leopold Toetsch wrote:

> Dan Sugalski <[EMAIL PROTECTED]> wrote:
>
> > You're going to run into problems no matter what you do, and as
> > transcoding could happen with each comparison arguably you need to make a
> > local copy of the string for each comparison, as otherwise you run the
> > risk of significant data loss as a sring gets transcoded back and forth
> > across a lossy boundary.
>
> Here is again, what I already had proposed:
>  * as long as there are only ascii keys: noop
>  * on first non ascii key, convert all hash to utf8 - doesn't change
>hash values

Well... this is the place where things fall down. It does change hash
values. You may find yourself transcoding from, say, Shift-JIS to Unicode,
which will result in most (if not all) of the characters in the string
changing code-points. That's likely to change hash values just a little...

> > Regardless, I think at least a single string copy with comparison against
> > that copy within the hash functions is the only way to get correct
> > results.
>
> Yes. That's the point - a single string copy. Now each compare could do
> a transcode i.e. generate a new string.

When I said "at least a single copy" I meant that we might have multiple
copies made, though that will definitely do nasty things to performance.

Dan

--"it's like this"---
Dan Sugalski  even samurai
[EMAIL PROTECTED] have teddy bears and even
  teddy bears get drunk

Re: [CVS ci] hash compare

2003-11-13 Thread Leopold Toetsch

Dan Sugalski <[EMAIL PROTECTED]> wrote:
> On Thu, 13 Nov 2003, Leopold Toetsch wrote:

>>  * as long as there are only ascii keys: noop
>>  * on first non ascii key, convert all hash to utf8 - doesn't change
>>hash values

> Well... this is the place where things fall down. It does change hash
> values. You may find yourself transcoding from, say, Shift-JIS to Unicode,

Dan. It starts with ascii keys, unicode code-points 0x00..0x7f.
When the first non-ascii key is to be stored, *ascii* keys are changed to
utf8.

>   Dan

leo

Re: [CVS ci] hash compare

2003-11-13 Thread Dan Sugalski

On Thu, 13 Nov 2003, Leopold Toetsch wrote:

> Dan Sugalski <[EMAIL PROTECTED]> wrote:
> > On Thu, 13 Nov 2003, Leopold Toetsch wrote:
>
> >>  * as long as there are only ascii keys: noop
> >>  * on first non ascii key, convert all hash to utf8 - doesn't change
> >>hash values
>
> > Well... this is the place where things fall down. It does change hash
> > values. You may find yourself transcoding from, say, Shift-JIS to Unicode,
>
> Dan. It starts with ascii keys, unicode code-points 0x00..0x7f.
> When the first non-ascii key is to be stored, *ascii* keys are changed to
> utf8.

Which doesn't do much good if we've got non-ascii, non-unicode keys.

Dan

--"it's like this"---
Dan Sugalski  even samurai
[EMAIL PROTECTED] have teddy bears and even
  teddy bears get drunk

Re: Calling conventions. Again

2003-11-13 Thread Dan Sugalski

On Thu, 13 Nov 2003, Steve Fink wrote:

> I'm getting a little confused about what we're arguing about. I will
> take a stab at describing the playing field, so people can correct me
> where I'm wrong:

The current big issue is whether non-PMC parameter types get counts.
There's not really anything else up for dispute. I think the for/against
argument there is:

for) Since we can't guarantee any sort of compiletime/runtime coherence,
if we're passing parameters in I/S/N registers we need to note how many.

against) That takes a lot of time and is unneeded most of the time

While I sympathize with the argument against, for it to be a feasable
soulution would require that runtime signatures not change from what was
in place at compiletime. Given that there may be some significant amount
of time between run and compile times (what with precompiled executables
and libraries), and that we are in general targeting, supporting, and
expecting fairly dynamic languages I'm expecting that we're going to have
issues here.

While the code that uses N/I/S registers is likely going to run at a lower
level than most perl/python/ruby code, we're looking to encourage their
use with HLL prototypes and such, which means we're going to see more use
of low-level types than we might have in the past, so we're going to get
more use of them than I think folks might be expecting.

> The other question is how much high-level argument passing stuff (eg,
> default values) should be crammed in. The argument against is that it
> will bloat the interface and slow down calling. The argument for is
> that it increases the amount of shared semantics between Parrot-hosted
> languages. An example of how default values could be wedged in is to
> say that any PMC parameter can be passed a Null PMC, which is the
> signal to use the default value (which would need to be computed in
> the callee, remember), or die loudly if the parameter is required.
> Supporting optional integer, numeric, or string parameters would be
> trickier. Or disallowed.

That's something separate, though needing addressing.

Dan

--"it's like this"---
Dan Sugalski  even samurai
[EMAIL PROTECTED] have teddy bears and even
  teddy bears get drunk

Re: This week's summary

2003-11-13 Thread Sam Vilain

On Tue, 11 Nov 2003 12:21, Piers Cawley wrote;

  >   Freeze/thaw data format and PBC
  > Leo Tötsch is working on the data serialization/deserialization
  > (aka Freeze/Thaw) system discussed over the last few weeks. He
  > wondered if there were any plans for the frozen image data
  > format. Leo's plan is to use PBC constant format (with possible
  > extensions) so things integrate neatly into bytecode. Dan had a
  > bunch of comments, but the PBC based format idea seemed to be well
  > received, with the caveat that it should be a 'dense' format.

Cool.  How are hooks in place for tools like Pixie and Tangram when
these objects are being stored?

Pixie works (Piers, please correct me if I'm wrong) by hanging magic
off the object to let Pixie know when it encounters a storage object.
It then lets the serialiser freeze the object.  So, it needs to
intercept the serialiser on a per-object *instance* basis.

I've been considering the differences and inadequacies between Tangram
and Pixie for quite some time now, and I think all I need to write a
persistence tool that is as useful as Pixie, but as OLTP capable as
Tangram is to be able to intercept freezing, and be able to feed the
thaw'er, on a defined *Class/Property* basis.

ie, I want to let the serialiser work just like Pixie, but when you
encounter an object whose Class is defined to have a slightly
different storage mechanism (perhaps there is a column which you want
a database to add to an index for you; maybe the object is a static
sized object and all the rows in it map to columns), it could be
extracted without re-inventing the serialisation wheel for the
process, or expecting the Classes to implement special methods.  Much
as I appreciate that Perl based index objects are just as functional
as database indexes, I'd quite like to use my database for that.  It's
actually quite good at it :-).

Would it be too much to ask for such hooks?  Or should I come up with
a sample implementation / design?
-- 
Sam Vilain, [EMAIL PROTECTED]

We must become the change we want to see.
 -- Mahatma Gandhi

Re: [CVS ci] hash compare

2003-11-13 Thread Thies C. Arntzen

On Wed, Nov 12, 2003 at 09:18:24PM +, Nicholas Clark wrote:
> On Wed, Nov 12, 2003 at 01:57:14PM -0500, Dan Sugalski wrote:
> 
> > You're going to run into problems no matter what you do, and as
> > transcoding could happen with each comparison arguably you need to make a
> > local copy of the string for each comparison, as otherwise you run the
> > risk of significant data loss as a sring gets transcoded back and forth
> > across a lossy boundary.
> 
> I think that this rules out what I was going to ask/suggested, having read
> Leo's patch. I was wondering why there wasn't a straight memcmp of the
> two strings whenever their encoding were the same. I presume that there
> are some encodings where two different binary representations are considered
> "equal", hence we can't blindly assume that a byte compare is sufficient.

yep, AFAIK there are at least two different ways to express
the german umlaut ä (i can see it on my keyboard) in unicode. i think
simon cozins has a good paper (somewhere) about that.

re,
tc

[perl #24489] intor.pod contains a slight error.

2003-11-13 Thread via RT

# New Ticket Created by  [EMAIL PROTECTED] 
# Please include the string:  [perl #24489]
# in the subject line of all future correspondence about this issue. 
# http://rt.perl.org/rt2/Ticket/Display.html?id=24489 >


I hope this is the correct place to send this.

intro.pod contains an error in one of the examples.

304c304
<  set I2 0
---
>  set I2, 0

Harry


__
Do you Yahoo!?
Protect your identity with Yahoo! Mail AddressGuard
http://antispam.yahoo.com/whatsnewfree

Re: [CVS ci] hash compare

2003-11-13 Thread Leopold Toetsch

Dan Sugalski <[EMAIL PROTECTED]> wrote:
> On Thu, 13 Nov 2003, Leopold Toetsch wrote:

>> Dan. It starts with ascii keys, unicode code-points 0x00..0x7f.
>> When the first non-ascii key is to be stored, *ascii* keys are changed to
>> utf8.

> Which doesn't do much good if we've got non-ascii, non-unicode keys.

Whatever these are (PDDs?), they aren't handled with string_compare (or
string_eqal) either. Hash has since some time a pluggable interface for
the compare (and the hash_value) function. Such keys need a different
compare function.

I just would like to replace the current slow string_compare solution
with a faster equivalent.

>   Dan

leo

Word for the day: Undocumentation

2003-11-13 Thread Melvin Smith

Just a reminder for new checkins. Please make sure there is
a minimum of a header comment for each routine you checkin
describing "just what the heck the routine does."
Debugging certain parts of Parrot has become akin to mapping out
a rabbit hole using marking flares.
For example, just picking a random file (ok not really so random...)
which will remain nameless to protect the innocent, with a 20
second search I locate blocks of 129 and 112 consecutive
lines with no comments. Not that comments have to be an exact
ratio but ours is approaching 1 percent. I think overall there
is just too much undocumentation going on. :)
Cheers,

-Melvin

Review of a book about VM

2003-11-13 Thread Stéphane Payrard

Disclaimer: Pardon my French :)

I have bought "Virtual Machine Design and Implementation in C++"
by Bill Blunden. This book has very positive reviews (see
slashdot or amazon.com). It seems to impress people by the
apparent width of covered topics. Most of it is off topic. The
book gives to the moderately knowledgeable reader no insight
about virtual machines or even about language compilation.

No architectural clues. Absolutely nothing about tight run loops,
copy on write, compiling , optimizing code (SSA, tail calls,
continuation passing style) or jitting code. The remarks about
memory management are laughable. The book present a trivial VM
and make no attempt to compare the features of existing VMs.

The author has the chutzpah, pedantry or naiveté to compare himself
to Knuth: "If I had money, I could offer an award like Don Knuth
(pronounced Ka-Nooth)". But it is unclear that the book has been
thru any reviewing or editorial process before publishing.
I don't say there are obvious typos or so many factual errors.
The book is just vain and vacuous.

It is difficult to make a review because the book has about no
substance concerning its stated subject: virtual machines. The
author takes tangent after tangent and gives his opinion about
almost anything concerning the computer industry, cramming
irrelevant knowledge and speaking of all the "wars" he has waged.
He usually states in one paragraph one can be said in one
sentence. Most of code in the book has no interest whatsoever,
like gigantic switches or declaration of symbolic constants.

Visually disturbing notes are plain rants or state the obvious:
"virtual machines are run-time systems, but not all run-time
systems are virtual machines" (boy! I was thinking that was VMs
all the way down). One note even propagates the old urban myth
about TCP/IP: "[the DoD] was originally interested in designing a
command and control network that could survive a nuclear war".

In Chapter 1, "History and Goal", the author conveys the idea
that coding on top of a virtual machine is a good protection
about obsolescence of the real architectures. But the book will
hardly convince you about the relevance of the presented VM
according to this criteria because it apparently only runs  on
x86. The author states his goals about the VM he presents: 1/
portability 2/ simplicity 3/ Performance. Modulo order, these
properties are true of any sensical software project. This
does not give a clue about what is the purpose of the VM. He
cites Java, so the purpose may be to run such a language. But
after reading the book, I had no further clue.

Chapter 2, 5 and 7 is about the presentation of his assembler.
Don't ask me about the logic or the interest of the presentation.
Almost nothing is specific about VM. So why bother to say more?

Chapter 3 talks about a debugger. 

Chapter 6 is about virtual interrupts. The author seems to have
used DOS as a model and is clearly limiting his discussion about
x86. So much about portability.

Chapter 8 called "advanced topics" explains the software industry
progression toward increased abstractions going from binary to
high level languages. There is a comparaison between Linux and
Windows. Go figure.

The anemic index is 6 pages and reflects the book, almost none of
the entries are specific to VM.

Each chapter is followed by a reference with a book list and an
explanation of what they cover. Certainly better that the
gratuitous show off list of academic papers. The choice of listed
books is usually pretty good but often the comments show the
cluelessness of the author. I enjoyed this one concerning the
Tanenbaum book about OSes: "People don't often realize that it
was Minix that led to the development of Linux. I don't think
people give Andrew enough credit. His ideas on modularity are
very contemporary".

The back cover says "covers  in detail features such as memory
management, TCP/IP networking (...) everything expected from a
commercial system. In fact the author decides that real garbage
collection is too slow and too complex, does not even consider
reference counting and decides to roll his own malloc. I don't
know what TCP/IP has to do with VM... 

The book certainly does not fulfill the the back cover promises.
It seems to me that the described VM is to VMs what a "hello
world" is to real C programs.

I did not bother to see the content of the bundled CD-ROM.


--
 stef

Re: Review of a book about VM

2003-11-13 Thread Melvin Smith

At 06:30 PM 11/13/2003 +0100, Stéphane Payrard wrote:
Disclaimer: Pardon my French :)

I have bought "Virtual Machine Design and Implementation in C++"
by Bill Blunden. This book has very positive reviews (see
slashdot or amazon.com). It seems to impress people by the
apparent width of covered topics. Most of it is off topic. The
book gives to the moderately knowledgeable reader no insight
about virtual machines or even about language compilation.
Stef,

Thanks for the review. Yours is the 2nd review I've read about this
book. I was about to buy it but after the 1st review I changed my
mind as it echoed your review in almost every sense.
I did browse it at the bookstore and it seemed
that the book tackled no real topics/problems associated with
modern VM design. Disappointing.
-Melvin

Re: Word for the day: Undocumentation

2003-11-13 Thread Michael Scott

 ...too much undocumentation going on.

One of the reasons I started putting stuff on the wiki was because I 
could see that updating documentation was not a high priority.

On the wiki I neither have to have CVS checkin rights, nor do I have to 
wait for someone with those rights to act upon what I suggest. This has 
led to my own parallel documentation - I even document the state of the 
documentation.

I know what I have put together is incomplete and inadequate, but do I 
move it forward, and I keep it up to date.

When it comes to pointing out that parrot_assembly.pod is just an 
earlier version of PDD 6, or that the "Per-entity comments" section in 
PDD 7 needs some thought, or that a submissions.pod should be added, I 
get warnocked.

I'm fine with that, I understand why - this is not a rant - but I do 
think that Parrot has a steep learning curve and that good 
documentation is essential if we want to lower it. The potential 
benefits seem obvious.

I'd like to volunteer myself as official Parrot documentation person - 
a semi-autonomous process with clearly defined protocols and goals - 
and the necessary rights to achieve them.

I'm happy to expand on what I mean by that - if I get a response.

Mike

Re: Word for the day: Undocumentation

2003-11-13 Thread Dan Sugalski

On Thu, 13 Nov 2003, Michael Scott wrote:

> I'd like to volunteer myself as official Parrot documentation person -
> a semi-autonomous process with clearly defined protocols and goals -
> and the necessary rights to achieve them.
>
> I'm happy to expand on what I mean by that - if I get a response.

If you're willing, fine with me. Get yourself an account on perl.org and
mail me the name and we'll get you CVS checkin privs. And we can discuss
what you've got in mind.

Dan

--"it's like this"---
Dan Sugalski  even samurai
[EMAIL PROTECTED] have teddy bears and even
  teddy bears get drunk

Re: Word for the day: Undocumentation

2003-11-13 Thread Melvin Smith

At 08:10 PM 11/13/2003 +0100, Michael Scott wrote:
 ...too much undocumentation going on.

One of the reasons I started putting stuff on the wiki was because I could 
see that updating documentation was not a high priority.

On the wiki I neither have to have CVS checkin rights, nor do I have to 
wait for someone with those rights to act upon what I suggest. This has 
led to my own parallel documentation - I even document the state of the 
documentation.

I know what I have put together is incomplete and inadequate, but do I 
move it forward, and I keep it up to date.
I've used the wiki quite a few times lately. It is great.


When it comes to pointing out that parrot_assembly.pod is just an earlier 
version of PDD 6, or that the "Per-entity comments" section in PDD 7 needs 
some thought, or that a submissions.pod should be added, I get warnocked.
That will keep happening until our number of active, core developers grows 
beyond
its current number of 4-5. Also, a couple of the guys split time between
Parrot and P5 so its worse.


I'd like to volunteer myself as official Parrot documentation person - a 
semi-autonomous process with clearly defined protocols and goals - and the 
necessary rights to achieve them.
Would commit privs to the docs directories help, for a start?

-Melvin

Re: [CVS ci] hash compare

2003-11-13 Thread Nicholas Clark

On Wed, Nov 12, 2003 at 02:07:52PM -0800, Mark A. Biggar wrote:

> And even when the sequence of Unicode code-points is the same, some
> encodings have multiple byte sequences for the same code-point.  For 
> example, UTF-8 has two ways to encode a code-point that is larger the
> 0x (Unicode as code-points up to 0x10FFF), as either two 16 bit
> surrogate code points encoded as two 3 byte UTF-8 code sequences or as
> a single value encoded as a single 4 or 5 byte UTF-8 code sequence.

Is it legal to encode surrogate pairs as UTF8? Or does that count as
malformed UTF8?

Nicholas Clark

Re: This week's summary

2003-11-13 Thread Leopold Toetsch

Sam Vilain <[EMAIL PROTECTED]> wrote:
> On Tue, 11 Nov 2003 12:21, Piers Cawley wrote;

>  >   Freeze/thaw data format and PBC
>  > Leo Tötsch is working on the data serialization/deserialization

> Cool.  How are hooks in place for tools like Pixie and Tangram when
> these objects are being stored?

Finally freeze/thaw will be overridable as well as the {de,}serializer
itself. The latter has an interface looking like io->vtable->push_pmc()
or i = io->vtable->shift_integer(), which is the same as a PMC has.

> Would it be too much to ask for such hooks?  Or should I come up with
> a sample implementation / design?

We can discuss that better, when the code is in.

leo

Re: Review of a book about VM

2003-11-13 Thread Jonathan Worthington

From: "Melvin Smith" <[EMAIL PROTECTED]>
At 06:30 PM 11/13/2003 +0100, Stéphane Payrard wrote:
>>Disclaimer: Pardon my French :)
>>
>>I have bought "Virtual Machine Design and Implementation in C++"
>>by Bill Blunden. This book has very positive reviews (see
>>slashdot or amazon.com). It seems to impress people by the
>>apparent width of covered topics. Most of it is off topic. The
>>book gives to the moderately knowledgeable reader no insight
>>about virtual machines or even about language compilation.
>
>Stef,
>
>Thanks for the review. Yours is the 2nd review I've read about this
>book. I was about to buy it but after the 1st review I changed my
>mind as it echoed your review in almost every sense.
>
>I did browse it at the bookstore and it seemed
>that the book tackled no real topics/problems associated with
>modern VM design. Disappointing.

Any chance anyone could recommend a good book about VM design and
implementation?  Perferably one that covers some stuff that's relevant to
Parrot.

Thanks,

Jonathan

Re: Calling conventions. Again

2003-11-13 Thread Pete Lomax

Hi,
New to this list, so please excuse any glaring stupidity.


I've been thinking about porting a small language to run on parrot,
and the call/return conventions. This is what I plan to do, at least
for my local routines. I'll follow the rules a bit more closely for
globals and external calls (but not this week).

I'd be interested to see what sort of signature changes between 
compile and runtime you think are likely to happen, as I have to admit
I have never encountered such a beast. Doesn't that force 
non-prototyped calls?

As I understand it, the requirements are:

  Call a routine explicitly and be as efficient as possible/practical.
You know what the parameters are and where they should go.

  Call a routine indirectly given two items:
The address of the routine, and
A list (or similar structure) of the parameters.
You don't know, and you probably don't care what they are.
Generally speaking, efficiency in this case is less important.

  Optional parameters in both calling methods.


Without optional parameters I will simply translate as follows:

procedure fred(integer x, string y)

to:

_u_fred:
I5=P3[1]
S5=P3[2]
_fred:


Basically, replace the 0/1 prototype flag with a mangled label.

Yes/no?


For optional parameter handling, I will do something like:

procedure fred(integer x=0, string y="Default")

_u_fred:
length I1, P3   # total number of params
lt I1, 1, _fred
I5=P3[1]
lt I1, 2, _fred
S5=P3[2]
_fred:
eq I1, 2 fred_main  # where 2 is max_params
set S5 "Default"
eq I1, 1 fred_main
set I5 0
fred_main:

In this first stab, all optional parameters must be grouped on
the rhs. Not sure yet how much more complicated it gets if not.

I don't know what games might need to be played with overflow 
parameters in P3, I can imagine needing to extend a short P3 with 
some defaults and chop off the first 11 somewhere in _u_fred.

Any comments?

Pete

Re: Review of a book about VM

2003-11-13 Thread Michel Rijnders

On Thu, 2003-11-13 at 20:08, Melvin Smith wrote:
> At 06:30 PM 11/13/2003 +0100, Stéphane Payrard wrote:
> >I have bought "Virtual Machine Design and Implementation in C++"
> >by Bill Blunden. This book has very positive reviews (see
> >slashdot or amazon.com). It seems to impress people by the
> >apparent width of covered topics. Most of it is off topic. The
> >book gives to the moderately knowledgeable reader no insight
> >about virtual machines or even about language compilation.
> 
> Thanks for the review. Yours is the 2nd review I've read about this
> book. I was about to buy it but after the 1st review I changed my
> mind as it echoed your review in almost every sense.

On a related note: what, if any, books would you recommend to someone
who is eager to join parrot-development, but feels he doesn't know
enough about compiler design, virtual machines etc.

Michel Rijnders

PS I'm new to this list: hi all!

Re: [CVS ci] hash compare

2003-11-13 Thread Jeff Clites

On Nov 13, 2003, at 2:21 PM, Nicholas Clark wrote:

On Wed, Nov 12, 2003 at 02:07:52PM -0800, Mark A. Biggar wrote:

And even when the sequence of Unicode code-points is the same, some
encodings have multiple byte sequences for the same code-point.  For
example, UTF-8 has two ways to encode a code-point that is larger the
0x (Unicode as code-points up to 0x10FFF), as either two 16 bit
surrogate code points encoded as two 3 byte UTF-8 code sequences or as
a single value encoded as a single 4 or 5 byte UTF-8 code sequence.
Is it legal to encode surrogate pairs as UTF8? Or does that count as
malformed UTF8?
No, it's not legal. As of Unicode 3.2, it's not permissible to encode a 
non-BMP (that is, code point > 0x) character in UTF-8 via two 
3-byte UTF-8 sequences. There is another encoding which does this, 
called CESU-8, but I don't think it's really ever used.

JEff

Re: reducing size of the Devel::Cover html report

2003-11-13 Thread Ovid

--- Michael Carman <[EMAIL PROTECTED]> wrote:
> Cosmetically, everything should look almost identical. Behind the scenes I've
> pretty much gutted and rewritten everything. Most significantly, it no longer
> requires (uses) the Template Toolkit.

Out of curiosity, why did you remove Template Toolkit?

Cheers,
Ovid

=
Silence is Evilhttp://users.easystreet.com/ovid/philosophy/indexdecency.htm
Ovid   http://www.perlmonks.org/index.pl?node_id=17000
Web Programming with Perl  http://users.easystreet.com/ovid/cgi_course/

__
Do you Yahoo!?
Protect your identity with Yahoo! Mail AddressGuard
http://antispam.yahoo.com/whatsnewfree

Re: reducing size of the Devel::Cover html report

2003-11-13 Thread Michael Carman

On 11/13/2003 5:54 PM, Ovid wrote:
> --- Michael Carman <[EMAIL PROTECTED]> wrote:
>> Cosmetically, everything should look almost identical. Behind the scenes I've
>> pretty much gutted and rewritten everything. Most significantly, it no longer
>> requires (uses) the Template Toolkit.
>
> Out of curiosity, why did you remove Template Toolkit?

The overriding reason was that it gave me much better control over the output. I
felt like I was fighting with the template to get the results I wanted.

Another reason is that I had to read the entire source file into memory and
build up a data structure (with lots of preformatted data) before filling in the
template. Splitting up the formatting is awkward. The new structure doesn't do
that; and it allows me to read the source file line-by-line.

I think templates are the most useful when you want to allow multiple ways of
presenting the same data, or to have a non-programmer define the presentation.
The latter certainly doesn't apply here. If there were a way to make a single
backend (that organized the data to be reported) that could be used to fill in
multiple templates (e.g. both the text and HTML formats) I'd be more positive on
using them. Honestly, though, I don't see how it's possible. The data
organization is very simple, the bulk of the work is in the formatting
(particularly the markup for HTML).

Getting rid of a dependancy on a non-standard module was just a pleasant
side-effect.

I don't say any of this to criticize the Template Toolkit. I've never been too
excited over any templating system. I always find it much easier to do it all in
Perl.

I'm not sure what Paul will think of this -- the use of the templates was part
of his original design that I kept when I wrote the current HTML report. He may
not be happy with me for abandoning it now.

-mjc

Refactoring a test program: advice sought

Re: Refactoring a test program: advice sought

seeking suggestion

IMCC fixes

Re: Unifying call and return

Re: [CVS ci] hash compare

Re: Arena flags and floating exception

Re: [CVS ci] hash compare

Re: Calling conventions. Again

Re: IMCC fixes

Re: Calling conventions. Again

Re: [CVS ci] hash compare

Re: [CVS ci] hash compare

Re: [CVS ci] hash compare

Re: Calling conventions. Again

Re: This week's summary

Re: [CVS ci] hash compare

[perl #24489] intor.pod contains a slight error.

Re: [CVS ci] hash compare

Word for the day: Undocumentation

Review of a book about VM

Re: Review of a book about VM

Re: Word for the day: Undocumentation

Re: Word for the day: Undocumentation

Re: Word for the day: Undocumentation

Re: [CVS ci] hash compare

Re: This week's summary

Re: Review of a book about VM

Re: Calling conventions. Again

Re: Review of a book about VM

Re: [CVS ci] hash compare

Re: reducing size of the Devel::Cover html report

Re: reducing size of the Devel::Cover html report

33 matches

Site Navigation

Mail list logo

Footer information