On 29/08/18 07:33, Steven Haigh wrote:
> When playing with NUMA support recently, I noticed a host would always hang 
> when trying to create a cpupool for the second NUMA node in the system.
> 
> I was using the following commands:
> # xl cpupool-create name=\"Pool-1\" sched=\"credit2\
> # xl cpupool-cpu-remove Pool-0 node:1
> # xl cpupool-cpu-add Pool-1 node:1
> 
> After the last command, the system would hang - requiring a hard reset of the 
> machine to fix.
> 
> I tried a different variation with the same result:
> # xl cpupool-create name=\"Pool-1\" sched=\"credit2\
> # xl cpupool-cpu-remove Pool-0 node:1
> # xl cpupool-cpu-add Pool-1 12
> 
> It turns out that the RAM was installed sub-optimally in this machine. A 
> partial output from 'xl info -n' shows:
> numa_info              :
> node:    memsize    memfree    distances
>   0:     67584      62608      10,21
>   1:             0              0      21,10
> 
> A machine where we could get this working every time shows:
> node:    memsize    memfree    distances
>   0:     34816      30483      10,21
>   1:     32768      32125      21,10
> 
> As we can deduce RAM misconfigurations in this scenario, I believe we should 
> check to ensure that RAM configuration / layout is sane *before* attempting 
> to 
> split the system and print a warning.
> 
> This would prevent a hard system freeze in this scenario.

RAM placement should not matter here. As the name already suggests
cpupools do assignment of cpus. RAM allocated will be preferred taken
from a local node, but this shouldn't be mandatory for success.

Would it be possible to use a debug hypervisor (e.g. 4.12-unstable) for
generating a verbose log (hypervisor boot parameter "loglvl=all") and
sending the complete hypervisor log?


Juergen

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xenproject.org
https://lists.xenproject.org/mailman/listinfo/xen-devel

Reply via email to