On 01/27/2018 02:58 AM, Michael Ellerman wrote: > Nathan Fontenot <nf...@linux.vnet.ibm.com> writes: > >> When DLPAR removing a CPU, the unmapping of the cpu from a node in >> unmap_cpu_from_node() should also invalidate the CPUs entry in the >> numa_cpu_lookup_table. There is not a guarantee that on a subsequent >> DLPAR add of the CPU the associativity will be the same and thus >> could be in a different node. Invalidating the entry in the >> numa_cpu_lookup_table causes the associativity to be read from the >> device tree at the time of the add. > > This last part seems to contradict the change log of commit d4edc5b6c480 > ("powerpc: Fix the setup of CPU-to-Node mappings during CPU online"), > which seems to say that we shouldn't be looking at the device tree. > > Can you explain to me what I'm missing?
The commit you refer to addresses CPU online/offline behavior and is correct that we shouldn't reference the device tree. The cpu-to-node mapping shouldn't change across a offline/online operation since the CPU remains assigned to the partition the entire time. This patch addresses CPUs that have been DLPAR removed, and as such the CPU is no longer assigned to the partition. Given this we don't have a guarantee that the CPU will have the same node-to-cpu mapping when it is assigned back to the partition on a subsequent DLPAR add operation. Without this patch, the CPU is put back in the node it was in previously which may not match the node firmware states it belongs to. > > Also when did this break, always? Which commit should I mark this as > fixing? As far as I know this has always been broken. I've looked the the git logs for the numa and pseries cpu hotplug code and don't see a specific commit I can point at for breaking this. -Nathan