On Wed, Apr 26, 2017 at 10:40:45 +0200, Richard Henderson wrote: > On 04/26/2017 08:23 AM, Emilio G. Cota wrote: > >This paves the way for upcoming work. > > > >Reviewed-by: Richard Henderson <r...@twiddle.net> > >Signed-off-by: Emilio G. Cota <c...@braap.org> > >--- > > tcg-runtime.c | 21 +++++++++++++++++++++ > > tcg/tcg-runtime.h | 2 ++ > > tcg/tcg.h | 1 + > > 3 files changed, 24 insertions(+) > > > >diff --git a/tcg-runtime.c b/tcg-runtime.c > >index 4c60c96..90d2d4b 100644 > >--- a/tcg-runtime.c > >+++ b/tcg-runtime.c > >@@ -27,6 +27,7 @@ > > #include "exec/helper-proto.h" > > #include "exec/cpu_ldst.h" > > #include "exec/exec-all.h" > >+#include "exec/tb-hash.h" > > /* 32-bit helpers */ > >@@ -141,6 +142,26 @@ uint64_t HELPER(ctpop_i64)(uint64_t arg) > > return ctpop64(arg); > > } > >+ cpu_get_tb_cpu_state(env, &pc, &cs_base, &flags); > >+ tb = atomic_rcu_read(&cpu->tb_jmp_cache[tb_jmp_cache_hash_func(addr)]); > >+ if (likely(tb && tb->pc == addr && tb->cs_base == cs_base && > >+ tb->flags == flags)) { > > This comparison is wrong. It will incorrectly reject a TB for i386 guest > when CS_BASE != 0. You really want > > tb = atomic_rcu_read(&cpu->tb_jmp_cache[tb_jmp_cache_hash_func(addr)]); > if (tb) { > cpu_get_tb_cpu_state(env, &pc, &cs_base, &flags); > if (tb->pc == pc && tb->cs_base == cs_base && tb->flags == flags) { > return tb->tc_ptr; > } > } > return tcg_ctx.code_gen_epilogue; > > where you don't even load the cpu state if there isn't a preliminary hit in > the cache.
Yes, I like this. > (Note to self: That minor optimization would also apply to tb_find.) FWIW I looked at tb_find -- you need the pc though, which comes from loading the CPU state: cpu_get_tb_cpu_state(env, &pc, &cs_base, &flags); ^^ tb = atomic_rcu_read(&cpu->tb_jmp_cache[tb_jmp_cache_hash_func(pc)]); ^^ If we wanted to really avoid getting all the state I guess we'd have to add another function that returned just the pc. E.