------- Comment #2 from tim at klingt dot org 2009-01-13 16:22 ------- (In reply to comment #1) > I don't see how this changes could cause more branch misses. If you do the > same .palign for the 4.4 code does the regression vanish? I would suspect > that the loop-stream detector catches one but not the other form for some > reason. Maybe the Intel folks can properly analyze this - HJ?
after doing some more tests, i wouldn't think too much about the branch misses. they seem to be quite dependent on the binary, even on linked libraries. i am more concerned about the inner loop ... -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=38824