------- Comment #7 from stuart at apple dot com 2006-08-23 21:54 ------- Time has passed, and GCC has improved on this testcase. Here is what we generate today (trunk, 23aug2006) for the original testcase:
movss b(%rip), %xmm0 movss a(%rip), %xmm1 unpcklps %xmm0, %xmm1 movaps %xmm1, %xmm0 xorps %xmm1, %xmm1 movlhps %xmm1, %xmm0 ret This isn't perfect, but it's much better than before. -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=24073