On 6/28/23 15:00, BALATON Zoltan wrote:
On Wed, 28 Jun 2023, Mark Cave-Ayland wrote:
From: Richard Henderson <richard.hender...@linaro.org>

Reviewed-by: Philippe Mathieu-Daudé <phi...@linaro.org>
Signed-off-by: Richard Henderson <richard.hender...@linaro.org>
Message-Id: <20230628071202.230991-2-richard.hender...@linaro.org>
Signed-off-by: Mark Cave-Ayland <mark.cave-ayl...@ilande.co.uk>
---
target/sparc/translate.c | 4 ++--
1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/target/sparc/translate.c b/target/sparc/translate.c
index bad2ec90a0..28d4cdb8b4 100644
--- a/target/sparc/translate.c
+++ b/target/sparc/translate.c
@@ -318,10 +318,10 @@ static void gen_goto_tb(DisasContext *s, int tb_num,
        tcg_gen_movi_tl(cpu_npc, npc);
        tcg_gen_exit_tb(s->base.tb, tb_num);
    } else {
-        /* jump to another page: currently not optimized */
+        /* jump to another page: we can use an indirect jump */
        tcg_gen_movi_tl(cpu_pc, pc);
        tcg_gen_movi_tl(cpu_npc, npc);
-        tcg_gen_exit_tb(NULL, 0);
+        tcg_gen_lookup_and_goto_ptr();

Out of curiosity, did you test this is actually faster? The reason I ask is because I've tried to optimise similar case in target/ppc by using lookup_and_goto_ptr but found it was slower than without that. I think this may depend on the usage but I wonder if that could be a generic issue with lookup_and_goto_ptr or only specific for the case I've tried.

It is faster.

It should be *always* faster, because returning to the main cpu loop will always do more work than merely checking to see if we already have built the required TB.

If you see slowdowns, then *probably* you are using lookup_and_goto_ptr incorrectly in some instance, such that an interrupt has gotten overly delayed. (One must always return to the main loop after anything that might re-enable interrupts. Otherwise the interrupt handler may be delayed indefinitely. This was in fact the problem with v1 of this patch set.)


r~

Reply via email to