[Xen-devel] RFC: HVM de-privileged mode scheduling considerations

Ben Catterall Mon, 03 Aug 2015 06:37:17 -0700

Hi all,

I am working on an x86 proof-of-concept to evaluate if it is feasible tomove device models and x86 emulation code for HVM guests into ade-privileged context.

I was hoping to get feedback from relevant maintainers on schedulingconsiderations for this system to mitigate potential DoS attacks.


Many thanks in advance,
Ben

This is intended as a proof-of-concept, with the aim of determining ifthis idea is feasible within performance constraints.


Motivation
----------

The motivation for moving the device models and x86 emulation code intoring 3 is to mitigate a system compromise due a bug in any of thesesystems. These systems are currently part of the hypervisor and,consequently, a bug in any of these could allow an attacker to gaincontrol (or perform a DOS) of

Xen and/or guests.

Migrating between PCPUs
-----------------------

There is a need to support migration between pcpus so that the schedulercan still perform this operation. However, there is an issue to resolve.Currently, I have a per-vcpu copy of the Xen ring 0 stack up to thepoint of entering the de-privileged mode. This allows us to restore thisstack and then continue from the entry point when we have finished inde-privileged mode. There will be per-pcpu data on these per-vcpu stackssuch as saved stack frame pointers for the per-pcpu stack,smp_processor_id() responses etc.

Therefore, it will be necessary to lock the vcpu to the current pcpuwhen it enters this user mode so that it does not wake up on a differentpcpu where such pointers and other data are invalid. We can do this bysetting a hard affinity to the pcpu that the vcpu is executing on. Seecommon/wait.c which does something similar to what I am doing.

However, needing to have hard affinity to a pcpu leads to the followingproblem:- An attacker could lock multiple vcpus to a single pcpu, leading to aDoS. This could be achieved by spinning in a loop in Xen de-privilegedmode (assuming a bug in this mode) and performing this operation onmultiple vcpus at once. The attacker could wait until all of their vcpuswere on the same pcpu and then execute this attack. This could cause thepcpu to, effectively, lock up, as it will be under heavy load, and wewould be unable to move work elsewhere.

A solution to the DoS would be to force migration to another pcpu, ifafter, say, 100 quanta have passed where the vcpu has remained inde-privileged mode. This forcing of migration would require us toforcibly complete the de-privileged operation, and then, just beforereturning into the guest, force a cpu change. We could not just force amigration at the schedule call point as the Xen stack needs to unwind tofree up resources. We would reset this count each time we completed ade-privileged mode operation.

A legitimate long-running de-privileged operation would trigger thisforced migration mechanism. However, it is unlikely that such operationswill be needed and the count can be adjusted appropriately to mitigate this.


Any suggestions or feedback would be appreciated!

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xen.org
http://lists.xen.org/xen-devel

[Xen-devel] RFC: HVM de-privileged mode scheduling considerations

Reply via email to