Paolo Bonzini wrote:
The primary icc/icl use of SSE/SSE2 masking operations, of course, is in the auto-vectorization of fabs[f] and conditional operations:I'm curious, does ICC support vector arithmetic like this?
sum = 0.f; i__2 = *n; for (i__ = 1; i__ <= i__2; ++i__) if (a[i__] > 0.f) sum += a[i__]; .... (Windows/intel asm syntax) pxor xmm2, xmm2 cmpltps xmm2, xmm3 andps xmm3, xmm2 addps xmm0, xmm3 ...