On 01/26/2011 06:00 PM, Alexander Graf wrote:
Richard Henderson wrote:
>  On 01/26/2011 01:23 AM, Alexander Graf wrote:
>
>>  agraf@toonie:/studio/s390/qemu-s390>  grep deposit target-s390x/translate.c
>>      tcg_gen_deposit_i64(regs[reg], regs[reg], tmp, 0, 32);
>>      tcg_gen_deposit_i64(regs[reg], regs[reg], v, 0, 32);
>>      tcg_gen_deposit_i64(regs[reg], regs[reg], tmp, 0, 16);
>>      tcg_gen_deposit_i64(regs[reg], regs[reg], v, 0, 8);
>>          tcg_gen_deposit_i64(regs[r1], regs[r1], tmp, 48, 16);
>>          tcg_gen_deposit_i64(regs[r1], regs[r1], tmp, 32, 16);
>>
>>  The 0, 32 and 0, 16 versions should get accelerated pretty well while
>>  the 32, 16 and 48, 16 are not I assume?
>>
>
>  No, only the 0,16 and 0,8 deposits correspond to a hardware insn on x86.
>
>  Given that the 0,32 lowpart writeback is almost certainly the most common
>  operation for s390x, I doubt the deposit patch will help with an x86 host.
>
>  Have you thought about buffering the lowpart writeback in the translator?
>  I.e. when a 32-bit insn writes to a register, remember that value without
>  writing it back.  If the next insn in the TB is also 32-bit, reuse the
>  saved value, etc.  Only perform the writeback for 64-bit insns using the
>  register as a source, end of TB, and places that can take an exception.
>

That's basically what uli's tcg_sync op did which got rejected upstream :).

Keeping it only inside of the translator would break on page faults, as
the lower 32 bits of the register would lie around in a temporary which
is invisible for the page fault resolver.

It should be possible to have the exception injector look up instruction pointer in a bitmap in the TB, and if set, execute a completion which writes back the temporary to the correct register. Not trivial, but I imagine useful for many optimizations which combine several target instructions into one host instruction (probably most useful for risc->cisc translations).

--
error compiling committee.c: too many arguments to function


Reply via email to