wip-rtl return location

Andy Wingo Thu, 02 Aug 2012 07:30:10 -0700

Hi,

Some brief thoughts on the wip-rtl branch.  Currently it has this
strange "return location" thing, where it specifies the register(s) to
which to return value(s), and the number of expected values and whether
it expects a rest list or not.  Problem is, this return location is like
a little program that needs to be interpreted at runtime.  Worse, it
seems to assume that return values will have to be passed in memory.


Instead I'd rather just use Dybvig's suggestion: every call instruction
is preceded by an MV return address.  For e.g. (values (f)), calling `f'
would be:

    ...
    goto CALL
MVRA:    
    truncate-and-jump RA
CALL:
    call f
RA:
    return

So the overhead of multiple values in the normal single-value case is
one jump per call.  When we do native compilation, this cost will be
negligible.  OTOH for MV returns, we return to a different address than
the one on the stack, which will cause a branch misprediction (google
"return stack buffers" for more info).  Of course this is not relevant
to the interpreter, because all of these branches are indirect, but it
will be in the future, so it's a good idea to think about these things
now.  With this design, the caller is responsible for handling MV
returns, not the callee.

Anyway, MV return will cause a branch misprediction.  Oh well.  I think
we can live with it.  Single-valued returns are the common case, and
they will be predicted correctly.

So, another thing.  The reason for the previous "return location" design
was because I wanted to have just two registers reserved by the
implementation: the instruction pointer and the frame pointer.  Wanting
an IP is obvious.  It's important to locate frame pointers so that
various pieces of code can walk the stack frames: for example the
delimited continuation code, the backtrace printer, the debugger, etc.
It's possible to just using a stack pointer and use dynamic tables to
find where the frame pointer is, like the x86-64 architecture does (or
-fomit-frame-pointer), but that requires more sophistication on the part
of the runtime, and I don't think we're really ready for that right now.

As I said, I wanted just the IP and the FP.  I didn't want an SP because
it causes so much performance noise in the current VM.  But then I
realized that in the RTL VM, it doesn't need to be accessed frequently,
because more values are addressed against the FP, and we're not pushing
and popping temporaries.  So we can actually keep it around, and it
might not need to be in a register.  It retains its useful
characteristics of allowing variable-sized data to be (temporarily)
allocated on the stack, as in procedure calls or MV returns, and as a
stack delimiter for GC.

In summary:

  - I will remove the "return location" stuff from wip-rtl;

  - All calls will be mv-calls

  - MV returns will return to 1 instruction before the RA

  - All calls will be preceded by a jump over the MVRA

  - Eventually we can remove the MVRA slot from stack frames, because it
    is computable from the RA

  - The stack pointer is back in town!

Andy
-- 
http://wingolog.org/

wip-rtl return location

Reply via email to