Issue 142221
Summary LLVM doesn't know that psrad xmm,xmm ignores the high part of rhs
Labels new issue
Assignees
Reporter Alcaro
    ```c++
#include <immintrin.h>

__m128i a(__m128i __ix, __m128i __iy) {
	return _mm_sra_epi32(__ix, _mm_unpacklo_epi32(__iy, _mm_setzero_si128()));
}
__m128i b(__m128i __ix, __m128i __iy) {
	return _mm_sra_epi32(__ix, __iy);
}

__m128i c(__m128i __ix, __m128i __iy) {
	return _mm_sra_epi32(__ix, _mm_unpackhi_epi32(__iy, _mm_setzero_si128()));
}
__m128i d(__m128i __ix, __m128i __iy) {
	return _mm_sra_epi32(__ix, _mm_unpackhi_epi32(__iy, __iy));
}
```
Expected:
```
a(long long vector[2], long long vector[2]):
        psrad   xmm0, xmm1
        ret

b(long long vector[2], long long vector[2]):
        psrad   xmm0, xmm1
 ret

c(long long vector[2], long long vector[2]):
        pshufd  xmm1, xmm1, 250
        psrad   xmm0, xmm1
        ret

d(long long vector[2], long long vector[2]):
        pshufd  xmm1, xmm1, 250
        psrad   xmm0, xmm1
        ret
```
Actual:
```
a(long long vector[2], long long vector[2]):
        xorps   xmm2, xmm2
        movss   xmm2, xmm1
 psrad   xmm0, xmm2
        ret

b(long long vector[2], long long vector[2]):
        psrad   xmm0, xmm1
        ret

c(long long vector[2], long long vector[2]):
        pxor    xmm2, xmm2
 punpckhdq       xmm1, xmm2
        psrad   xmm0, xmm1
        ret

d(long long vector[2], long long vector[2]):
        pshufd  xmm1, xmm1, 250
 psrad   xmm0, xmm1
 ret
```
https://godbolt.org/z/1j5nWoW6q

Originally found in libstdc++ std::experimental::simd https://godbolt.org/z/d5Gdnsqch (they should probably add corresponding fixes to their header)

(See also #141475, though that one's about the shift instructions' lhs/ret, not rhs)
_______________________________________________
llvm-bugs mailing list
llvm-bugs@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-bugs

Reply via email to