Re: arm64: dropping prevent_bootmem_remove_notifier

Sudarshan Rajagopalan Thu, 29 Oct 2020 14:04:25 -0700



Hi Anshuman, David,

Thanks for all the detailed explanations for the reasoning to havebootmem protected from being removed. Also, I do agree drivers beingable to mark memory sections isn't the right thing to do.

We went ahead with the approach of using "mem=" as you suggested tolimit the bootmem and add remaining blocks usingadd_memory_driver_managed() so that driver has ownership of theseblocks.

We do have some follow-up questions regarding this - will initiate adiscussion soon.



On 2020-10-18 22:37, Anshuman Khandual wrote:

Hello Sudarshan,

On 10/17/2020 04:41 AM, Sudarshan Rajagopalan wrote:
Hello Anshuman,
In the patch that enables memory hot-remove (commit bbd6ec605c0f("arm64/mm: Enable memory hot remove")) for arm64, there’s a notifierput in place that prevents boot memory from being offlined andremoved. Also commit text mentions that boot memory on arm64 cannot beremoved. We wanted to understand more about the reasoning for this.X86 and other archs doesn’t seem to do this prevention. There’s alsocomment in the code that this notifier could be dropped in future ifand when boot memory can be removed.
Right and till then the notifier cannot be dropped. There was a lot of
discussions
around this topic during multiple iterations of memory hot remove
series. Hence, I
would just request you to please go through them first. This list here
is from one
such series (https://lwn.net/Articles/809179/) but might not beexhaustive.
-----------------
On arm64 platform, it is essential to ensure that the boot timediscovered
memory couldn't be hot-removed so that,

1. FW data structures used across kexec are idempotent
   e.g. the EFI memory map.
2. linear map or vmemmap would not have to be dynamically split, andcan
   map boot memory at a large granularity
3. Avoid penalizing paths that have to walk page tables, where we canbe
   certain that the memory is not hot-removable
-----------------
The primary reason being kexec which would need substantial reworkotherwise.
The current logic is that only “new” memory blocks which are hot-addedcan later be offlined and removed. The memory that system booted upwith cannot be offlined and removed. But there could be many usercasessuch as inter-VM memory sharing where a primary VM could offline andhot-remove a block/section of memory and lend it to secondary VM whereit could hot-add it. And after usecase is done, the reverse happenswhere secondary VM hot-removes and gives it back to primary which canhot-add it back. In such cases, the present logic for arm64 doesn’tallow this hot-remove in primary to happen.
That is not true. Each VM could just boot with a minimum boot memorywhich cannot be offlined or removed but then a possible larger portion of memorycan behot added during the boot process itself, making them available for anyfutureinter VM sharing purpose. Hence this problem could easily be solved inthe user
space itself.
Also, on systems with movable zone that sort of guarantees pages to bemigrated and isolated so that blocks can be offlined, this logic alsodefeats the purpose of having a movable zone which system can rely onmemory hot-plugging, which say virt-io mem also relies on for fullyplugged memory blocks.
ZONE_MOVABLE does not really guarantee migration, isolation andremoval. Thereare reasons an offline request might just fail. I agree that thosereasons arenormally not platform related but core memory gives platform anopportunity todecline an offlining request via a notifier. Hence ZONE_MOVABLE offlinecan be
denied. Semantics wise we are still okay.
This might look bit inconsistent thatmovablecore/kernelcore/movable_node withfirmware sending in 'hot pluggable' memory (IIRC arm64 does not reallysupportthis yet), the system might end up with ZONE_MOVABLE marked boot memorywhichcannot be offlined or removed. But an offline notifier action isorthogonal.Hence did not block those kernel command line paths that createsZONE_MOVABLE
during boot to preserve existing behavior.
I understand that some region of boot RAM shouldn’t be allowed to beremoved, but such regions won’t be allowed to be offlined in firstplace since pages cannot be migrated and isolated, example reservedpages.
So we’re trying to understand the reasoning for such a prevention putin place for arm64 arch alone.
Primary reason being kexec. During kexec on arm64, next kernel's memorymap isderived from firmware and not from current running kernel. So the nextkernelwill crash if it would access memory that might have been removed inrunningkernel. Until kexec on arm64 changes substantially and takes intoaccount thereal available memory on the current kernel, boot memory cannot beremoved.
One possible way to solve this is by marking the required sections as“non-early” by removing the SECTION_IS_EARLY bit in itssection_mem_map.
That is too intrusive from core memory perspective.

 This puts these sections in the context of “memory hotpluggable”
which can be offlined-removed and added-onlined which are part of boot
RAM itself and doesn’t need any extra blocks to be hot added. This way
of marking certain sections as “non-early” could be exported so that
module drivers can set the required number of sections as “memory
hotpluggable”. This could have certain checks put in place to see
which sections are allowed, example only movable zone sections can be
marked as “non-early”.
Giving modules the right to mark memory hotpluggable ? That is toointrusive
and would still not solve the problem with kexec.
Your thoughts on this? We are also looking for different ways to solvethe problem without having to completely dropping this notifier, butjust putting out the concern here about the notifier logic that isbreaking our usecase which is a generic memory sharing usecase usingmemory hotplug feature.
Completely preventing boot memory offline and removal is essential forkexecto work as expected afterwards. As suggested previously, splitting theVMmemory into boot and non boot chunks during init can help work aroundthisrestriction effectively in userspace itself and would not require anykernel
changes.

- Anshuman



Sudarshan

--

Qualcomm Innovation Center, Inc. is a member of Code Aurora Forum, aLinux Foundation Collaborative Project

Re: arm64: dropping prevent_bootmem_remove_notifier

Reply via email to