On Tue, May 23, 2017 at 04:38:48PM +0200, Igor Mammedov wrote: > It fixes/add missing _PXM object for non mapped CPU (x86) > and missing fdt node (virt-arm). > > It ensures that possible_cpus contains complete mapping if > numa is enabled by the time machine_init() is executed. > > As result non completely mapped CPUs: > 1) appear in ACPI/fdt blobs > 2) QMP query-hotpluggable-cpus command shows bound nodes for such CPUs > 3) allows to drop checks for has_node_id in numa only code, > reducing number of invariants incomplete mapping could produce > 4) moves fixup/implicit node init from runtime numa_cpu_pre_plug() > (when CPU object is created) to machine_numa_finish_init() which > helps to fix [1, 2] and make possible_cpus complete source > of numa mapping available even before CPUs are created. > > Signed-off-by: Igor Mammedov <imamm...@redhat.com>
Code that will be affected, but will keep the same behavior: * pre_plug handler when node-id is omitted on device_add: it already sets node_id=0 if !has_node_id * However, pre_plug behavior is being changed when node-id is set. See below. * build_srat() (both x86 and arm): already sets node_id = 0 if !has_node_id * bochs_bios_init(): already uses g_new0(), so already had omitted entries set to 0. Affected code that will change behavior with this patch: * pre_plug handler when node-id is set: * See my reply to patch 1/5. This patch restores the old behavior that rejected node-id != 0 on omitted CPUs. * ACPI _PXM initialization * arm/virt.c: numa-node-id will now be set for all CPUs * QMP/HMP query-hotpluggable-cpus output * Desirable change, as now the QMP command will reflect what the guest is really seeing Due to the _PXM and FDT changes, I have the same objection I have for patch 4/5: if we are going to break user expectations when using incomplete NUMA mappings, let's do that only once (when we start rejecting incomplete NUMA mappings). A few extra comments about the has_node_id checks below: > --- > hw/arm/virt-acpi-build.c | 4 +--- > hw/core/machine.c | 16 ++++++++++------ > hw/i386/acpi-build.c | 3 +-- > hw/i386/pc.c | 4 +--- > numa.c | 7 +------ > 5 files changed, 14 insertions(+), 20 deletions(-) > > diff --git a/hw/arm/virt-acpi-build.c b/hw/arm/virt-acpi-build.c > index e585206..977a794 100644 > --- a/hw/arm/virt-acpi-build.c > +++ b/hw/arm/virt-acpi-build.c > @@ -496,12 +496,10 @@ build_srat(GArray *table_data, BIOSLinker *linker, > VirtMachineState *vms) > srat->reserved1 = cpu_to_le32(1); > > for (i = 0; i < cpu_list->len; ++i) { > - int node_id = cpu_list->cpus[i].props.has_node_id ? > - cpu_list->cpus[i].props.node_id : 0; > core = acpi_data_push(table_data, sizeof(*core)); > core->type = ACPI_SRAT_PROCESSOR_GICC; > core->length = sizeof(*core); > - core->proximity = cpu_to_le32(node_id); > + core->proximity = cpu_to_le32(cpu_list->cpus[i].props.node_id); > core->acpi_processor_uid = cpu_to_le32(i); > core->flags = cpu_to_le32(1); > } > diff --git a/hw/core/machine.c b/hw/core/machine.c > index aaf3cff..964b75d 100644 > --- a/hw/core/machine.c > +++ b/hw/core/machine.c > @@ -720,19 +720,23 @@ static void machine_numa_finish_init(MachineState > *machine) > const CPUArchId *cpu_slot = &possible_cpus->cpus[i]; > > if (!cpu_slot->props.has_node_id) { > - if (default_mapping) { > - /* fetch default mapping from board and enable it */ > - CpuInstanceProperties props = cpu_slot->props; > - props.has_node_id = true; > - machine_set_cpu_numa_node(machine, &props, &error_fatal); > - } else { > + /* fetch default mapping from board and enable it */ > + CpuInstanceProperties props = cpu_slot->props; > + > + if (!default_mapping) { > /* record slots with not set mapping, > * TODO: make it hard error in future */ > char *cpu_str = cpu_slot_to_string(cpu_slot); > g_string_append_printf(s, "%sCPU %d [%s]", > s->len ? ", " : "", i, cpu_str); > g_free(cpu_str); > + > + /* non mapped cpus used to fallback to node 0 */ > + props.node_id = 0; > } > + > + props.has_node_id = true; > + machine_set_cpu_numa_node(machine, &props, &error_fatal); > } > } > if (s->len) { > diff --git a/hw/i386/acpi-build.c b/hw/i386/acpi-build.c > index afcadac..873880d 100644 > --- a/hw/i386/acpi-build.c > +++ b/hw/i386/acpi-build.c > @@ -2335,8 +2335,7 @@ build_srat(GArray *table_data, BIOSLinker *linker, > MachineState *machine) > srat->reserved1 = cpu_to_le32(1); > > for (i = 0; i < apic_ids->len; i++) { > - int node_id = apic_ids->cpus[i].props.has_node_id ? > - apic_ids->cpus[i].props.node_id : 0; > + int node_id = apic_ids->cpus[i].props.node_id; What about an assert(props.has_node_id) here? > uint32_t apic_id = apic_ids->cpus[i].arch_id; > > if (apic_id < 255) { > diff --git a/hw/i386/pc.c b/hw/i386/pc.c > index cf09949..84f0a6f 100644 > --- a/hw/i386/pc.c > +++ b/hw/i386/pc.c > @@ -788,9 +788,7 @@ static FWCfgState *bochs_bios_init(AddressSpace *as, > PCMachineState *pcms) > for (i = 0; i < cpus->len; i++) { > unsigned int apic_id = cpus->cpus[i].arch_id; > assert(apic_id < pcms->apic_id_limit); > - if (cpus->cpus[i].props.has_node_id) { > - numa_fw_cfg[apic_id + 1] = > cpu_to_le64(cpus->cpus[i].props.node_id); > - } > + numa_fw_cfg[apic_id + 1] = cpu_to_le64(cpus->cpus[i].props.node_id); Ditto. > } > for (i = 0; i < nb_numa_nodes; i++) { > numa_fw_cfg[pcms->apic_id_limit + 1 + i] = > diff --git a/numa.c b/numa.c > index e30702e..4ef6dea 100644 > --- a/numa.c > +++ b/numa.c > @@ -513,17 +513,12 @@ void numa_cpu_pre_plug(const CPUArchId *slot, > DeviceState *dev, Error **errp) > int node_id = object_property_get_int(OBJECT(dev), "node-id", > &error_abort); > > if (node_id == CPU_UNSET_NUMA_NODE_ID) { > - /* by default CPUState::numa_node was 0 if it wasn't set explicitly > - * TODO: make it error when incomplete numa mapping support is > removed > - */ > - node_id = 0; > - > /* due to bug in libvirt, it doesn't pass node-id from props on > * device_add as expected, so we have to fix it up here */ > if (slot->props.has_node_id) { > node_id = slot->props.node_id; > + object_property_set_int(OBJECT(dev), node_id, "node-id", errp); > } > - object_property_set_int(OBJECT(dev), node_id, "node-id", errp); Why not removing the has_node_id check completely, here? In case this patch get respun, I suggest also removing the has_node_id check at: build_cpus_aml() _PXM code. > } else if (node_id != slot->props.node_id) { > error_setg(errp, "node-id=%d must match numa node specified " > "with -numa option", node_id); > -- > 2.7.4 > > -- Eduardo