On Sun, 7 Apr 2013, Marc Glisse wrote:

 extern __inline __m128i __attribute__((__gnu_inline__, __always_inline__,
__artificial__))
 _mm_slli_epi16 (__m128i __A, int __B)
 {
-  return (__m128i)__builtin_ia32_psllwi128 ((__v8hi)__A, __B);
+  return (__m128i) ((__v8hi)__A << __B);
 }

Actually, I believe I have to keep using the builtins for shifts, because the intrinsics have well defined behavior for large __B whereas << and >> don't.

--
Marc Glisse

Reply via email to