Gabriel Paubert wrote:
On Wed, Jun 25, 2008 at 10:34:32AM -0500, Scott Wood wrote:
Kumar Gala wrote:
+/* Macros to workout the correct index for the FPR in the thread struct */
+#define FPRNUMBER(i) (((i) - PT_FPR0) >> 1)
+#define FPRHALF(i) (((i) - PT_FPR0) % 2)
Have you looked at what the compiler spits out here to make sure we aren't getting a divide? Seems like we could use '& 0x1'.
GCC's not *that* dumb. However, you may get some unnecessary sign-twiddling if "i" is signed.

Not for modulo 2, it's only an even/odd choice and GCC implements that efficiently IIRC. For other powers of 2,
making the left hand side unsigned helps the compiler.

From this:

int foo(int x)
{
        return x % 2;
}

I get this with -O3:

foo:
        mr 0,3
        srawi 3,3,1
        addze 3,3
        slwi 3,3,1
        subf 3,3,0
        blr
        .size   foo, .-foo
        .ident  "GCC: (GNU) 4.1.2"

Changing it to "x & 1", or to unsigned, gives this:

foo:
        rlwinm 3,3,0,31,31
        blr
        .size   foo, .-foo
        .ident  "GCC: (GNU) 4.1.2"

Maybe newer GCCs are better?

-Scott
_______________________________________________
Linuxppc-dev mailing list
Linuxppc-dev@ozlabs.org
https://ozlabs.org/mailman/listinfo/linuxppc-dev

Reply via email to