Intel introduces Advanced Matrix Extensions (AMX) [1] feature that consists of configurable two-dimensional "TILE" registers and new accelerator instructions that operate on them. TMUL (Tile matrix MULtiply) is the first accelerator instruction set to use the new registers.
Since AMX KVM patches have been merged into Linux release, this series is based on latest Linux release(5.17-rc4). According to the KVM design, the userspace VMM (e.g. Qemu) is expected to request guest permission for the dynamically-enabled XSAVE features only once when the first vCPU is created, and KVM checks guest permission in KVM_SET_CPUID2. Intel AMX is XSAVE supported and XSAVE enabled. Those extended features has large state while current kvm_xsave only allows 4KB. The AMX KVM has extended struct kvm_xsave to meet this requirenment and added one extra KVM_GET_XSAVE2 ioctl to handle extended features. From our test, the AMX live migration work well. Notice: This version still includes some definitions in the linux-headers, once Qemu sync those linux-headers, please directly remove patch 8. So ignore patch 8 changes. [1] Intel Architecture Instruction Set Extension Programming Reference https://software.intel.com/content/dam/develop/external/us/en/documents/\ architecture-instruction-set-extensions-programming-reference.pdf Thanks, Yang ---- Change history -------------- v2->v3: - Patch 4, misc cleanup(David). - Patch 3, updated the kvm_request_xsave_components()(Paolo). - Patch 8, rebased the Linux-header on latest Qemu. v1->v2: - Patch 1 moved "esa->ecx" into the "if{}"(Paolo). - Patch 3, the requiremnets from Paoalo, - Moved "esa->ecx" into the "if{}". - Used the "mask" as parameter to replace xtiledata bits in kvm_request_xsave_components() - Used the new defined KVM_X86_XCOMP_GUEST_SUPP from KVM to get supported_xcr0 from kvm_arch_get_supported_cpuid(). - Updated the kvm_request_xsave_components() for future usage. - Patch 5 added "case 0x1e:" in kvm_arch_init_vcpu()(Paolo). - Patch 6 replaced "if (e->size && e->offset)" with "if (e->size && e->offset && buflen >= e->size + e->offset)" for xsave and xrstor(Paolo). - Patch 8, which is new added patch and is only for linux-headers. This patch can be directly dropped once Qemu sync linux-headers. rfc v1->v1: - Patch 1 changed commit message(Kevin and Paolo). - Patch 2 changed commit message(Kevin and Paolo). - Patch 3, below requirements from Paolo, - Called ARCH_REQ_XCOMP_GUEST_PERM from x86_cpu_enable_xsave_components. Used kvm_request_xsave_components() to replace x86_xsave_req_perm(). Replaced syscall(ARCH_GET_XCOMP_GUEST_PERM) with kvm_arch_get_supported_cpuid() in kvm_request_xsave_components(). - Changed kvm_cpu_xsave_init() to use host_cpuid() instead of kvm_arch_get_supported_cpuid(). - Added the "function == 0xd" handle in kvm_arch_get_supported_cpuid(). - Patch 4, used "uint32_t ecx" to replace "uint32_t need_align, support_xfd". - Patch 6, below changes, - Changed the commit message(Kevin) and Used the new function - kvm_init_xsave() to replace some pieces of code(Wei). - Moved KVM_CAP_XSAVE2 extension check to kvm_arch_init_vcpu() to make the request permission before KVM_CAP_XSAVE2 extension check(Paolo). - Removed RFC prefix. Jing Liu (5): x86: Fix the 64-byte boundary enumeration for extended state x86: Add AMX XTILECFG and XTILEDATA components x86: Add XFD faulting bit for state components x86: Add AMX CPUIDs enumeration x86: Add support for KVM_CAP_XSAVE2 and AMX state migration Yang Zhong (2): x86: Grant AMX permission for guest linux-header: Sync the linux headers Zeng Guang (1): x86: Support XFD and AMX xsave data migration linux-headers/asm-x86/kvm.h | 3 + linux-headers/linux/kvm.h | 1 + target/i386/cpu.h | 43 ++++++++++++- target/i386/kvm/kvm_i386.h | 1 + target/i386/cpu.c | 72 ++++++++++++++++++++- target/i386/kvm/kvm-cpu.c | 11 ++-- target/i386/kvm/kvm.c | 121 +++++++++++++++++++++++++++++++----- target/i386/machine.c | 42 +++++++++++++ target/i386/xsave_helper.c | 33 ++++++++++ 9 files changed, 302 insertions(+), 25 deletions(-)