On Tue, 2011-10-25 at 17:23 +0530, Suzuki K. Poulose wrote: 
> The following patch implements the dynamic relocation processing for
> PPC32 kernel. relocate() accepts the target virtual address and relocates
>  the kernel image to the same.

Hi Suzuki,

Thanks for the patches.  I've been testing them on a 440-based card, and
encountered TLB error exceptions because the BSS section wasn't getting
properly cleared in early_init().

It turns out that some of the instructions which were modified in
relocate() weren't then getting flushed out of the d-cache into memory.
After that, early_init() executed the stale (non-modified) instructions
for the BSS area.  Those instructions just accessed offset 0 instead of
the actual BSS-related offsets.  That resulted in BSS not getting`
zeroed.

I was able to verify this on my 440 by comparing the d-cache and i-cache
entries for the BSS-accessing instructions in early_init() using a
RISCWatch.  As I suspected, the instructions in the d-cache showed the
corrected offsets, but the i-cache showed the old, non-relocated
offsets.

To fix the issue, I wrote the following patch, applied on top of your
patches.  Suggestions and comments are welcome.



>From c88ae39da0c0352f411aca8d9636990a442d47da Mon Sep 17 00:00:00 2001
From: Josh Poimboeuf <jpoim...@linux.vnet.ibm.com>
Date: Wed, 2 Nov 2011 16:41:24 -0500
Subject: [PATCH] Flush relocated instructions from data cache

After updating instructions with relocated addresses, flush them from
the data cache and invalidate the icache line so we don't execute stale
instructions.

Signed-off-by: Josh Poimboeuf <jpoim...@linux.vnet.ibm.com>
---
 arch/powerpc/kernel/reloc_32.S |   11 ++++++++++-
 1 files changed, 10 insertions(+), 1 deletions(-)

diff --git a/arch/powerpc/kernel/reloc_32.S
b/arch/powerpc/kernel/reloc_32.S
index 045d61e..a92857d 100644
--- a/arch/powerpc/kernel/reloc_32.S
+++ b/arch/powerpc/kernel/reloc_32.S
@@ -137,6 +137,9 @@ get_type:
        lwz     r0, 8(r9)       /* r_addend */
        add     r0, r0, r3      /* final addend */
        stwx    r0, r4, r7      /* memory[r4+r7]) = (u32)r0 */
+       dcbst   r4,r7           /* flush dcache line to memory */
+       sync                    /* wait for flush to complete */
+       icbi    r4,r7           /* invalidate icache line */
        b       nxtrela         /* continue */
 
        /* R_PPC_ADDR16_HI */
@@ -177,6 +180,9 @@ lo16:
        /* Store half word */
 store_half:
        sthx    r0, r4, r7      /* memory[r4+r7] = (u16)r0 */
+       dcbst   r4,r7           /* flush dcache line to memory */
+       sync                    /* wait for flush to complete */
+       icbi    r4,r7           /* invalidate icache line */
 
 nxtrela:
        cmpwi   r8, 0           /* relasz = 0 ? */
@@ -185,7 +191,10 @@ nxtrela:
        subf    r8, r6, r8      /* relasz -= relaent */
        b       applyrela
 
-done:  blr
+done:
+       sync                    /* wait for icache invalidates to complete */
+       isync                   /* discard any prefetched instructions */
+       blr
 
 
 p_dyn:         .long   __dynamic_start - 0b
-- 
1.7.4.1



_______________________________________________
Linuxppc-dev mailing list
Linuxppc-dev@lists.ozlabs.org
https://lists.ozlabs.org/listinfo/linuxppc-dev

Reply via email to