4.0 regression] Floating point computation far slower in 3.2 than in 2.95

uros at kss-loka dot si Thu, 03 Feb 2005 08:17:47 -0800

------- Additional Comments From uros at kss-loka dot si  2005-02-03 16:17 
-------
Promised timings, all on P4-3.2, 800MHz FSB:


gcc296 -O3:
real    0m0.978s
user    0m0.979s
sys     0m0.000s

gcc296 -O3 -march=i686:
real    0m1.061s
user    0m1.029s
sys     0m0.016s

gcc344-pre -03
real    0m0.962s
user    0m0.961s
sys     0m0.002s

gcc344-pre -03 -march=pentium4:
real    0m0.962s
user    0m0.961s
sys     0m0.000s

gcc-40 -O3 -march=i686:       <- scheduler is enabled here!
real    0m0.605s
user    0m0.605s
sys     0m0.000s

gcc-40 -O3 -march=pentium4:   <- scheduler disabled
real    0m0.599s
user    0m0.600s
sys     0m0.000s

gcc-40 -O3 -march=pentium4 -mfpmath=sse:
real    0m0.726s
user    0m0.727s
sys     0m0.000s

I didn't test 3.3 but I wouldn't call the results of 3.4 and 4.0 _FAR_SLOWER_ 
:) 

gcc -O3 -march=i686 -msse2 -mfpmath=sse
real    0m0.726s
user    0m0.725s
sys     0m0.002s

BTW: It looks there is a problem with -mfpmath=sse. I'll open another PR for
this problem: "Floating point computation far slower for -mfpmath=sse" :).

Steven, I guess this PR should be marked as fixed.


-- 
           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|WAITING                     |NEW
   Last reconfirmed|2004-08-01 00:24:18         |2005-02-03 16:17:27
               date|                            |


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8126

[Bug rtl-optimization/8126] [3.3/3.4/4.0 regression] Floating point computation far slower in 3.2 than in 2.95

Reply via email to