On 10/19/2015 11:04 AM, Prarit Bhargava wrote:
> The information in /sys/devices/system/cpu/cpuX/topology
> directory is useful for userspace monitoring applications and in-tree
> utilities like cpupower & turbostat.
> 
> When down'ing a CPU the /sys/devices/system/cpu/cpuX/topology directory is
> removed during the CPU_DEAD hotplug callback in the kernel.  The problem
> with this model is that the CPU has not been physically removed and the
> data in the topology directory is still valid.
> 
> This patch adds CONFIG_PERMANENT_CPU_TOPOLOGY, and is Y by default for
> x86, an N for all other arches.  When enabled the kernel is modified so
> that the topology directory is added to the core cpu sysfs files so that
> the topology directory exists for the lifetime of the CPU.  When
> disabled, the behavior of the current kernel is maintained (that is, the
> topology directory is removed on a down and added on an up).  Adding
> CONFIG_PERMANENT_CPU_TOPOLOGY may require additional architecture so that
> the cpumask data the CPU's topology is not cleared during a CPU down.
> 
> This patch combines drivers/base/topology.c and drivers/base/cpu.c to
> implement CONFIG_PERMANENT_CPU_TOPOLOGY, and leaves all arches except
> x86 with the current behavior.
> 
> Before patch:
> 
> [root@prarit cpu143]# ls
> cache    crash_notes       firmware_node  online     thermal_throttle
> cpufreq  crash_notes_size  microcode      power      topology
> cpuidle  driver            node3          subsystem  uevent
> 
> Down a cpu
> 
> [root@prarit cpu143]# echo 0 > online
> 
> [root@intel-brickland-05 cpu143]# ls
> cpuidle      crash_notes_size  firmware_node  online  subsystem
> crash_notes  driver            node3          power   uevent
> [root@intel-brickland-05 cpu143]# ls -l topology
> ls: cannot access topology: No such file or directory
> 
> After patch:
> 
> [root@prarit cpu143]# ls
> cache    crash_notes       firmware_node  online     thermal_throttle
> cpufreq  crash_notes_size  microcode      power      topology
> cpuidle  driver            node3          subsystem  uevent
> [root@prarit cpu143]# cat topology/*
> 27
> ffff,c0000000,000000ff,ffc00000,00000000
> 54-71,126-143
> 3
> 8000,00000000,00000080,00000000,00000000
> 71,143
> 
> Down a cpu
> 
> [root@prarit cpu143]# echo 0 > online
> [root@intel-brickland-05 cpu143]# ls
> cpuidle      crash_notes_size  firmware_node  online  subsystem  uevent
> crash_notes  driver          node3          power   topology
> [root@prarit cpu143]# cat topology/*
> 27
> ffff,c0000000,000000ff,ffc00000,00000000
> 54-71,126-143
> 3
> 8000,00000000,00000080,00000000,00000000
> 71,143
> 
> I did some light testing with and without BOOTPARAM_HOTPLUG_CPU0 enabled,
> and up'd and down'd CPUs in sequence, randomly, by thread group, by
> socket group and didn't see any issues.
> 
> Cc: Thomas Gleixner <t...@linutronix.de>
> Cc: Ingo Molnar <mi...@redhat.com>
> Cc: "H. Peter Anvin" <h...@zytor.com>
> Cc: x...@kernel.org
> Cc: Greg Kroah-Hartman <gre...@linuxfoundation.org>
> Cc: Borislav Petkov <b...@alien8.de>
> Cc: Len Brown <len.br...@intel.com>
> Cc: Andy Lutomirski <l...@amacapital.net>
> Cc: Zhu Guihua <zhugh.f...@cn.fujitsu.com>
> Cc: Denys Vlasenko <dvlas...@redhat.com>
> Cc: "Jan H. Schönherr" <jscho...@amazon.de>
> Cc: Boris Ostrovsky <boris.ostrov...@oracle.com>
> Cc: Prarit Bhargava <pra...@redhat.com>
> Cc: "Paul E. McKenney" <paul...@linux.vnet.ibm.com>
> Cc: Thomas Renninger <tr...@suse.de>
> Signed-off-by: Prarit Bhargava <pra...@redhat.com>
> ---
>  arch/x86/kernel/smpboot.c |   28 --------
>  drivers/base/Kconfig      |   13 ++++
>  drivers/base/Makefile     |    2 +-
>  drivers/base/cpu.c        |  135 +++++++++++++++++++++++++++++++++++++++
>  drivers/base/topology.c   |  155 
> ---------------------------------------------
>  5 files changed, 149 insertions(+), 184 deletions(-)
>  delete mode 100644 drivers/base/topology.c
> 
> diff --git a/arch/x86/kernel/smpboot.c b/arch/x86/kernel/smpboot.c
> index e0c198e..19082c7 100644
> --- a/arch/x86/kernel/smpboot.c
> +++ b/arch/x86/kernel/smpboot.c
> @@ -1322,32 +1322,6 @@ __init void prefill_possible_map(void)
>  
>  #ifdef CONFIG_HOTPLUG_CPU
>  
> -static void remove_siblinginfo(int cpu)
> -{
> -     int sibling;
> -     struct cpuinfo_x86 *c = &cpu_data(cpu);
> -
> -     for_each_cpu(sibling, topology_core_cpumask(cpu)) {
> -             cpumask_clear_cpu(cpu, topology_core_cpumask(sibling));
> -             /*/
> -              * last thread sibling in this cpu core going down
> -              */
> -             if (cpumask_weight(topology_sibling_cpumask(cpu)) == 1)
> -                     cpu_data(sibling).booted_cores--;
> -     }
> -
> -     for_each_cpu(sibling, topology_sibling_cpumask(cpu))
> -             cpumask_clear_cpu(cpu, topology_sibling_cpumask(sibling));
> -     for_each_cpu(sibling, cpu_llc_shared_mask(cpu))
> -             cpumask_clear_cpu(cpu, cpu_llc_shared_mask(sibling));
> -     cpumask_clear(cpu_llc_shared_mask(cpu));
> -     cpumask_clear(topology_sibling_cpumask(cpu));
> -     cpumask_clear(topology_core_cpumask(cpu));


It was just pointed out to me that the these masks tend to be used in
the kernel as "sibling & online mask", and "core & online_mask".  The other
values are indeed static and are safe to use.

I'll "self-nak" this and come up with a slightly better patchset.

P.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Reply via email to