On 4/15/2025 11:07 AM, mhkelle...@gmail.com wrote: > From: Michael Kelley <mhkli...@outlook.com> > > Current code allocates the "hyperv_pcpu_input_arg", and in > some configurations, the "hyperv_pcpu_output_arg". Each is a 4 KiB > page of memory allocated per-vCPU. A hypercall call site disables > interrupts, then uses this memory to set up the input parameters for > the hypercall, read the output results after hypercall execution, and > re-enable interrupts. The open coding of these steps leads to > inconsistencies, and in some cases, violation of the generic > requirements for the hypercall input and output as described in the > Hyper-V Top Level Functional Spec (TLFS)[1]. > > To reduce these kinds of problems, introduce a family of inline > functions to replace the open coding. The functions provide a new way > to manage the use of this per-vCPU memory that is usually the input and > output arguments to Hyper-V hypercalls. The functions encapsulate > key aspects of the usage and ensure that the TLFS requirements are > met (max size of 1 page each for input and output, no overlap of > input and output, aligned to 8 bytes, etc.). Conceptually, there > is no longer a difference between the "per-vCPU input page" and > "per-vCPU output page". Only a single per-vCPU page is allocated, and > it provides both hypercall input and output memory. All current > hypercalls can fit their input and output within that single page, > though the new code allows easy changing to two pages should a future > hypercall require a full page for each of the input and output. > > The new functions always zero the fixed-size portion of the hypercall > input area so that uninitialized memory is not inadvertently passed > to the hypercall. Current open-coded hypercall call sites are > inconsistent on this point, and use of the new functions addresses > that inconsistency. The output area is not zero'ed by the new code > as it is Hyper-V's responsibility to provide legal output. > > When the input or output (or both) contain an array, the new functions > calculate and return how many array entries fit within the per-vCPU > memory page, which is effectively the "batch size" for the hypercall > processing multiple entries. This batch size can then be used in the > hypercall control word to specify the repetition count. This > calculation of the batch size replaces current open coding of the > batch size, which is prone to errors. Note that the array portion of > the input area is *not* zero'ed. The arrays are almost always 64-bit > GPAs or something similar, and zero'ing that much memory seems > wasteful at runtime when it will all be overwritten. The hypercall > call site is responsible for ensuring that no part of the array is > left uninitialized (just as with current code). > > The new functions are realized as a single inline function that > handles the most complex case, which is a hypercall with input > and output, both of which contain arrays. Simpler cases are mapped to > this most complex case with #define wrappers that provide zero or NULL > for some arguments. Several of the arguments to this new function > must be compile-time constants generated by "sizeof()" > expressions. As such, most of the code in the new function can be > evaluated by the compiler, with the result that the code paths are > no longer than with the current open coding. The one exception is > new code generated to zero the fixed-size portion of the input area > in cases where it is not currently done. > > [1] > https://learn.microsoft.com/en-us/virtualization/hyper-v-on-windows/tlfs/tlfs > > Signed-off-by: Michael Kelley <mhkli...@outlook.com> > Reviewed-by: Nuno Das Neves <nunodasne...@linux.microsoft.com> > --- > > Notes: > Changes in v3: > * Added wrapper #define hv_hvcall_in_batch_size() to get the batch size > without setting up hypercall input/output parameters. This call can be > used when the batch size is needed for validation checks or memory > allocations prior to disabling interrupts. > > Changes in v2: > * Added comment that hv_hvcall_inout_array() should always be called with > interrupts disabled because it is returning pointers to per-cpu memory > [Nuno Das Neves] > > include/asm-generic/mshyperv.h | 106 +++++++++++++++++++++++++++++++++ > 1 file changed, 106 insertions(+) >
This is very cool, thanks for taking the time! I think the function naming could be more intuitive, e.g. hv_setup_*_args(). I'd not block it for that reason, but would be super happy if you would update it. What do you think? Thanks, Easwar (he/him)