> On 13 Aug 2020, at 18:28, Julien Grall <jul...@xen.org> wrote:
> 
> Hi Bertrand,
> 
> On 31/07/2020 14:16, Bertrand Marquis wrote:
>>> On 30 Jul 2020, at 22:50, Julien Grall <jul...@xen.org> wrote:
>>> On 30/07/2020 11:24, Bertrand Marquis wrote:
>>>> At the moment on Arm, a Linux guest running with KTPI enabled will
>>>> cause the following error when a context switch happens in user mode:
>>>> (XEN) p2m.c:1890: d1v0: Failed to walk page-table va 0xffffff837ebe0cd0
>>>> The error is caused by the virtual address for the runstate area
>>>> registered by the guest only being accessible when the guest is running
>>>> in kernel space when KPTI is enabled.
>>>> To solve this issue, this patch is doing the translation from virtual
>>>> address to physical address during the hypercall and mapping the
>>>> required pages using vmap. This is removing the conversion from virtual
>>>> to physical address during the context switch which is solving the
>>>> problem with KPTI.
>>> 
>>> To echo what Jan said on the previous version, this is a change in a stable 
>>> ABI and therefore may break existing guest. FAOD, I agree in principle with 
>>> the idea. However, we want to explain why breaking the ABI is the *only* 
>>> viable solution.
>>> 
>>> From my understanding, it is not possible to fix without an ABI breakage 
>>> because the hypervisor doesn't know when the guest will switch back from 
>>> userspace to kernel space. The risk is the information provided by the 
>>> runstate wouldn't contain accurate information and could affect how the 
>>> guest handle stolen time.
>>> 
>>> Additionally there are a few issues with the current interface:
>>>   1) It is assuming the virtual address cannot be re-used by the userspace. 
>>> Thanksfully Linux have a split address space. But this may change with KPTI 
>>> in place.
>>>   2) When update the page-tables, the guest has to go through an invalid 
>>> mapping. So the translation may fail at any point.
>>> 
>>> IOW, the existing interface can lead to random memory corruption and 
>>> inacurracy of the stolen time.
>> I agree but i am not sure what you want me to do here.
>> Should i add more details in the commit message ?
>>> 
>>>> This is done only on arm architecture, the behaviour on x86 is not
>>>> modified by this patch and the address conversion is done as before
>>>> during each context switch.
>>>> This is introducing several limitations in comparison to the previous
>>>> behaviour (on arm only):
>>>> - if the guest is remapping the area at a different physical address Xen
>>>> will continue to update the area at the previous physical address. As
>>>> the area is in kernel space and usually defined as a global variable this
>>>> is something which is believed not to happen. If this is required by a
>>>> guest, it will have to call the hypercall with the new area (even if it
>>>> is at the same virtual address).
>>>> - the area needs to be mapped during the hypercall. For the same reasons
>>>> as for the previous case, even if the area is registered for a different
>>>> vcpu. It is believed that registering an area using a virtual address
>>>> unmapped is not something done.
>>> 
>>> This is not clear whether the virtual address refer to the current vCPU or 
>>> the vCPU you register the runstate for. From the past discussion, I think 
>>> you refer to the former. It would be good to clarify.
>> Ok i will try to clarify.
>>> 
>>> Additionally, all the new restrictions should be documented in the public 
>>> interface. So an OS developper can find the differences between the 
>>> architectures.
>>> 
>>> To answer Jan's concern, we certainly don't know all the guest OSes 
>>> existing, however we also need to balance the benefit for a large majority 
>>> of the users.
>>> 
>>> From previous discussion, the current approach was deemed to be acceptable 
>>> on Arm and, AFAICT, also x86 (see [1]).
>>> 
>>> TBH, I would rather see the approach to be common. For that, we would an 
>>> agreement from Andrew and Jan in the approach here. Meanwhile, I think this 
>>> is the best approach to address the concern from Arm users.
>> From this I get that you want me to document the specific behaviour on Arm 
>> on the public header describing the hypercall, right ?
> 
> Yes please. The public header is usually where an OS developper will look for 
> details. Although, at the moment, the documentation is not very great as you 
> often have to dig in Xen code to understand how it is meant to work :(. But 
> we are trying to improve that.

Ok i will add some comments in the header.

Cheers
Bertrand

> 
> Cheers,
> 
> -- 
> Julien Grall


Reply via email to