On 04/07/2019 15:22, Jan Beulich wrote: > On 04.07.2019 16:10, Andrew Cooper wrote: >> On 01/07/2019 12:18, Jan Beulich wrote: >>> --- a/xen/arch/x86/x86_emulate/x86_emulate.c >>> +++ b/xen/arch/x86/x86_emulate/x86_emulate.c >>> @@ -9100,6 +9100,133 @@ x86_emulate( >>> put_stub(stub); >>> >>> if ( rc != X86EMUL_OKAY ) >>> + goto done; >>> + >>> + state->simd_size = simd_none; >>> + break; >>> + } >>> + >>> + case X86EMUL_OPC_EVEX_66(0x0f38, 0x90): /* vpgatherd{d,q} >>> mem,[xyz]mm{k} */ >>> + case X86EMUL_OPC_EVEX_66(0x0f38, 0x91): /* vpgatherq{d,q} >>> mem,[xyz]mm{k} */ >>> + case X86EMUL_OPC_EVEX_66(0x0f38, 0x92): /* vgatherdp{s,d} >>> mem,[xyz]mm{k} */ >>> + case X86EMUL_OPC_EVEX_66(0x0f38, 0x93): /* vgatherqp{s,d} >>> mem,[xyz]mm{k} */ >>> + { >>> + typeof(evex) *pevex; >>> + union { >>> + int32_t dw[16]; >>> + int64_t qw[8]; >>> + } index; >>> + bool done = false; >>> + >>> + ASSERT(ea.type == OP_MEM); >>> + generate_exception_if((!evex.opmsk || evex.brs || evex.z || >>> + evex.reg != 0xf || >>> + modrm_reg == state->sib_index), >>> + EXC_UD); >>> + avx512_vlen_check(false); >>> + host_and_vcpu_must_have(avx512f); >>> + get_fpu(X86EMUL_FPU_zmm); >>> + >>> + /* Read destination and index registers. */ >>> + opc = init_evex(stub); >>> + pevex = copy_EVEX(opc, evex); >>> + pevex->opcx = vex_0f; >>> + opc[0] = 0x7f; /* vmovdqa{32,64} */ >>> + /* >>> + * The register writeback below has to retain masked-off elements, >>> but >>> + * needs to clear upper portions in the index-wider-than-data >>> cases. >>> + * Therefore read (and write below) the full register. The >>> alternative >>> + * would have been to fiddle with the mask register used. >>> + */ >>> + pevex->opmsk = 0; >>> + /* Use (%rax) as destination and modrm_reg as source. */ >>> + pevex->b = 1; >>> + opc[1] = (modrm_reg & 7) << 3; >>> + pevex->RX = 1; >>> + opc[2] = 0xc3; >>> + >>> + invoke_stub("", "", "=m" (*mmvalp) : "a" (mmvalp)); >>> + >>> + pevex->pfx = vex_f3; /* vmovdqu{32,64} */ >>> + pevex->w = b & 1; >>> + /* Switch to sib_index as source. */ >>> + pevex->r = !mode_64bit() || !(state->sib_index & 0x08); >>> + pevex->R = !mode_64bit() || !(state->sib_index & 0x10); >>> + opc[1] = (state->sib_index & 7) << 3; >>> + >>> + invoke_stub("", "", "=m" (index) : "a" (&index)); >>> + put_stub(stub); >>> + >>> + /* Clear untouched parts of the destination and mask values. */ >>> + n = 1 << (2 + evex.lr - ((b & 1) | evex.w)); >>> + op_bytes = 4 << evex.w; >>> + memset((void *)mmvalp + n * op_bytes, 0, 64 - n * op_bytes); >>> + op_mask &= (1 << n) - 1; >>> + >>> + for ( i = 0; op_mask; ++i ) >>> + { >>> + signed long idx = b & 1 ? index.qw[i] : index.dw[i]; >> No signed. > Hmm - would you mind this remaining consistent with the AVX > counterpart code? (As an aside I continue to think it is a bad > thing to not have explicit "signed" when we actually mean signed > quantities, seeing the still large amount of plain short/int/long > uses that actually should be unsigned.)
That was conclusively objected to by multiple other committers, for a number of reasons. It is unfortunate that some examples slipped in, but as the coding style is not changing, they should be taken out. ~Andrew _______________________________________________ Xen-devel mailing list Xen-devel@lists.xenproject.org https://lists.xenproject.org/mailman/listinfo/xen-devel