Re: MMX built-ins performance oddities

2005-02-22 Thread Richard Henderson
On Mon, Feb 21, 2005 at 05:13:14PM +0100, Prakash Punnoor wrote: > ... but gcc 3.4.3 has serious issues. It even miscompiles SSE code. > Are plans to fix this ... No. r~

Re: MMX built-ins performance oddities

2005-02-21 Thread Prakash Punnoor
Paolo Bonzini schrieb: - vector version is about 3% faster than above instead of 10% slower - wow! So why is gcc 4.0 producing worse code when using intel style intrinsics and why isn't the union version using builtins as fast as using the vector version? I can answer why unions are slower: that's

Re: MMX built-ins performance oddities

2005-02-20 Thread Paolo Bonzini
- vector version is about 3% faster than above instead of 10% slower - wow! So why is gcc 4.0 producing worse code when using intel style intrinsics and why isn't the union version using builtins as fast as using the vector version? I can answer why unions are slower: that's because they are spil

Re: MMX built-ins performance oddities

2005-02-20 Thread Prakash Punnoor
Andrew Pinski schrieb: On Feb 19, 2005, at 8:21 AM, Prakash Punnoor wrote: Is this a known issue with gcc-3.4.3? I compiled the code using -O2 -march=athlon-xp -g3. If you want a smaller test case, I could try to do so. Right now I just didn't want to waste my time in case this is a know issue or I

Re: MMX built-ins performance oddities

2005-02-19 Thread Prakash Punnoor
Andrew Pinski schrieb: On Feb 19, 2005, at 8:21 AM, Prakash Punnoor wrote: Is this a known issue with gcc-3.4.3? I compiled the code using -O2 -march=athlon-xp -g3. If you want a smaller test case, I could try to do so. Right now I just didn't want to waste my time in case this is a know issue or I

Re: MMX built-ins performance oddities

2005-02-19 Thread Andrew Pinski
On Feb 19, 2005, at 8:21 AM, Prakash Punnoor wrote: Is this a known issue with gcc-3.4.3? I compiled the code using -O2 -march=athlon-xp -g3. If you want a smaller test case, I could try to do so. Right now I just didn't want to waste my time in case this is a know issue or I did something stupid

MMX built-ins performance oddities

2005-02-19 Thread Prakash Punnoor
Hi, I noticed something strange when I use GCC's builtins for MMX: I defined some unions: typedef int v4hi __attribute__ ((__mode__(__V4HI__))); typedef int v2si __attribute__ ((__mode__(__V2SI__))); typedef int di __attribute__ ((__mode__(__DI__))); typedef union { v4hi v; short s[4]; int i[