Re: [edk2-devel] [RFC 1/2] efi/memattr: Use desc_size instead of total size to check for corruption
On 13/01/2025 12:00, Usama Arif wrote: > > > On 13/01/2025 11:27, Usama Arif wrote: >> >> >> On 13/01/2025 02:33, Dave Young wrote: >>> On Fri, 10 Jan 2025 at 18:54, Usama Arif wrote: On 10/01/2025 07:21, Ard Biesheuvel wrote: > On Thu, 9 Jan 2025 at 17:36, Usama Arif wrote: >> >> >> >> On 09/01/2025 15:45, Ard Biesheuvel wrote: >>> On Wed, 8 Jan 2025 at 23:00, Usama Arif wrote: The commit in [1] introduced a check to see if EFI memory attributes table was corrupted. It assumed that efi.memmap.nr_map remains constant, but it changes during late boot. Hence, the check is valid during cold boot, but not in the subsequent kexec boot. This is best explained with an exampled. At cold boot, for a test machine: efi.memmap.nr_map=91, memory_attributes_table->num_entries=48, desc_size = 48 Hence, the check introduced in [1] where 3x the size of the entire EFI memory map is a reasonable upper bound for the size of this table is valid. In late boot __efi_enter_virtual_mode calls 2 functions that updates efi.memmap.nr_map: - efi_map_regions which reduces the `count` of map entries (for e.g. if should_map_region returns false) and which is reflected in efi.memmap by __efi_memmap_init. At this point efi.memmap.nr_map becomes 46 in the test machine. - efi_free_boot_services which also reduces the number of memory regions available (for e.g. if md->type or md->attribute is not the right value). At this point efi.memmap.nr_map becomes 9 in the test machine. Hence when you kexec into a new kernel and pass efi.memmap, the paramaters that are compared are: efi.memmap.nr_map=9, memory_attributes_table->num_entries=48, desc_size = 48 where the check in [1] is no longer valid with such a low efi.memmap.nr_map as it was reduced due to efi_map_regions and efi_free_boot_services. A more appropriate check is to see if the description size reported by efi and memory attributes table is the same. [1] https://lore.kernel.org/all/20241031175822.2952471-2-ardb+...@google.com/ Fixes: 8fbe4c49c0cc ("efi/memattr: Ignore table if the size is clearly bogus") Reported-by: Breno Leitao Signed-off-by: Usama Arif --- drivers/firmware/efi/memattr.c | 16 ++-- 1 file changed, 6 insertions(+), 10 deletions(-) >>> >>> The more I think about this, the more I feel that kexec on x86 should >>> simply discard this table, and run with the firmware code RWX (which >>> is not the end of the world). >> >> >> By discard this table, do you mean kexec not use e820_table_firmware? > > No, I mean kexec ignores the memory attributes table. > >> Also a very basic question, what do you mean by run with the firmware >> RWX? >> > > The memory attributes table is an overlay for the EFI memory map that > describes which runtime code regions may be mapped with restricted > permissions. Without this table, everything will be mapped writable as > well as executable, but only in the EFI page tables, which are only > active when an EFI call is in progress. > Thanks for explaining! So basically get rid of memattr.c :) Do you mean get rid of it only for kexec, or not do it for any boot (including cold boot)? I do like this idea! I couldn't find this in the git history, but do you know if this was added in the linux kernel just because EFI spec added support for it, or if there was a specific security problem? >>> >>> Usama, can you try the patch below? >>> [ format is wrong due to webmail corruption. But if it works I can >>> send a formal patch later ] >>> >>> $ git diff arch/x86 >>> diff --git a/arch/x86/platform/efi/quirks.c b/arch/x86/platform/efi/quirks.c >>> index 846bf49f2508..58dc77c5210e 100644 >>> --- a/arch/x86/platform/efi/quirks.c >>> +++ b/arch/x86/platform/efi/quirks.c >>> @@ -561,6 +561,11 @@ int __init efi_reuse_config(u64 tables, int nr_tables) >>> >>> if (!efi_guidcmp(guid, SMBIOS_TABLE_GUID)) >>> ((efi_config_table_64_t *)p)->table = data->smbios; >>> + >>> + /* Not bother to play with mem attr table across kexec */ >>> + if (!efi_guidcmp(guid, EFI_MEMORY_ATTRIBUTES_TABLE_GUID)) >>> + ((efi_config_table_64_t *)p)->table = >>> EFI_INVALID_TABLE_ADDR; >>> + >>> p += sz; >>> } >>> >> >> This would work, I am guessing it will have a similar effect to what I sent >> last week in >> https:
Re: [edk2-devel] [RFC 1/2] efi/memattr: Use desc_size instead of total size to check for corruption
On 20/01/2025 10:32, Ard Biesheuvel wrote: > On Mon, 20 Jan 2025 at 11:27, Usama Arif wrote: >> >> > ... >> Hi Ard, >> >> Just wanted to check how should we proceed forward? Should we try and fix >> the warning >> and corruption during kexec as done in this series or not initialize memory >> attributes >> table at all in kexec boot? I would prefer fixing the issues as in this >> series. >> > > I would prefer kexec boot on x86 to disregard the memory attributes > table entirely. Would you like Dave to send something like https://lore.kernel.org/all/CALu+AoS8tb=hgaybdw5og4a1qboxhvuidpu0ynmz-ne1nbq...@mail.gmail.com/ on the mailing list (wrapped in ifdef CONFIG_X86_64) or we could do https://lore.kernel.org/all/fd63613c-fd26-42de-b5ed-cc734f72e...@gmail.com/ as well. Or if there is any other way you prefer, I am happy to implement and test. Thanks, Usama -=-=-=-=-=-=-=-=-=-=-=- Groups.io Links: You receive all messages sent to this group. View/Reply Online (#121054): https://edk2.groups.io/g/devel/message/121054 Mute This Topic: https://groups.io/mt/110517813/21656 Group Owner: devel+ow...@edk2.groups.io Unsubscribe: https://edk2.groups.io/g/devel/unsub [arch...@mail-archive.com] -=-=-=-=-=-=-=-=-=-=-=-
Re: [edk2-devel] [RFC 1/2] efi/memattr: Use desc_size instead of total size to check for corruption
On 20/01/2025 11:29, Ard Biesheuvel wrote: > On Mon, 20 Jan 2025 at 11:50, Usama Arif wrote: >> >> >> >> On 20/01/2025 10:32, Ard Biesheuvel wrote: >>> On Mon, 20 Jan 2025 at 11:27, Usama Arif wrote: >>> ... Hi Ard, Just wanted to check how should we proceed forward? Should we try and fix the warning and corruption during kexec as done in this series or not initialize memory attributes table at all in kexec boot? I would prefer fixing the issues as in this series. >>> >>> I would prefer kexec boot on x86 to disregard the memory attributes >>> table entirely. >> >> Would you like Dave to send something like >> https://lore.kernel.org/all/CALu+AoS8tb=hgaybdw5og4a1qboxhvuidpu0ynmz-ne1nbq...@mail.gmail.com/ >> on the mailing list (wrapped in ifdef CONFIG_X86_64) >> > > I prefer this approach. and no need for the ifdef, this is x86 > specific code, and the memory attributes table is already ignored > entirely on 32-bit x86 iirc ah yes, I ignored the file name when reviewing it and just focused on the function :) Will wait for Dave to send it. Thanks, Usama -=-=-=-=-=-=-=-=-=-=-=- Groups.io Links: You receive all messages sent to this group. View/Reply Online (#121055): https://edk2.groups.io/g/devel/message/121055 Mute This Topic: https://groups.io/mt/110517813/21656 Group Owner: devel+ow...@edk2.groups.io Unsubscribe: https://edk2.groups.io/g/devel/unsub [arch...@mail-archive.com] -=-=-=-=-=-=-=-=-=-=-=-
Re: [edk2-devel] [RFC 1/2] efi/memattr: Use desc_size instead of total size to check for corruption
On 22/01/2025 05:36, Dave Young wrote: > Hi, > On Mon, 20 Jan 2025 at 19:48, Usama Arif wrote: >> >> >> >> On 20/01/2025 11:29, Ard Biesheuvel wrote: >>> On Mon, 20 Jan 2025 at 11:50, Usama Arif wrote: On 20/01/2025 10:32, Ard Biesheuvel wrote: > On Mon, 20 Jan 2025 at 11:27, Usama Arif wrote: >> >> > ... >> Hi Ard, >> >> Just wanted to check how should we proceed forward? Should we try and >> fix the warning >> and corruption during kexec as done in this series or not initialize >> memory attributes >> table at all in kexec boot? I would prefer fixing the issues as in this >> series. >> > > I would prefer kexec boot on x86 to disregard the memory attributes > table entirely. Would you like Dave to send something like https://lore.kernel.org/all/CALu+AoS8tb=hgaybdw5og4a1qboxhvuidpu0ynmz-ne1nbq...@mail.gmail.com/ on the mailing list (wrapped in ifdef CONFIG_X86_64) >>> >>> I prefer this approach. and no need for the ifdef, this is x86 >>> specific code, and the memory attributes table is already ignored >>> entirely on 32-bit x86 iirc >> >> ah yes, I ignored the file name when reviewing it and just focused on the >> function :) >> >> Will wait for Dave to send it. > > Ok, I will add reported-by from you and suggested-by from Ard. But I > can not test the efi mem attr, I'd prefer to have your test results > first. Could you confirm that? > Please also add Reported-by: Breno Leitao who originally discovered and reported the issue. I have tested it and it works, efi_memattr_init will just return 0 as efi_mem_attr_table = EFI_INVALID_TABLE_ADDR, so it won't be initialized. Feel free to add Tested-by tag from me. Thanks, Usama -=-=-=-=-=-=-=-=-=-=-=- Groups.io Links: You receive all messages sent to this group. View/Reply Online (#121057): https://edk2.groups.io/g/devel/message/121057 Mute This Topic: https://groups.io/mt/110517813/21656 Group Owner: devel+ow...@edk2.groups.io Unsubscribe: https://edk2.groups.io/g/devel/unsub [arch...@mail-archive.com] -=-=-=-=-=-=-=-=-=-=-=-
Re: [edk2-devel] [RFC 1/2] efi/memattr: Use desc_size instead of total size to check for corruption
On 09/01/2025 15:45, Ard Biesheuvel wrote: > On Wed, 8 Jan 2025 at 23:00, Usama Arif wrote: >> >> The commit in [1] introduced a check to see if EFI memory attributes >> table was corrupted. It assumed that efi.memmap.nr_map remains >> constant, but it changes during late boot. >> Hence, the check is valid during cold boot, but not in the subsequent >> kexec boot. >> >> This is best explained with an exampled. At cold boot, for a test >> machine: >> efi.memmap.nr_map=91, >> memory_attributes_table->num_entries=48, >> desc_size = 48 >> Hence, the check introduced in [1] where 3x the size of the >> entire EFI memory map is a reasonable upper bound for the size of this >> table is valid. >> >> In late boot __efi_enter_virtual_mode calls 2 functions that updates >> efi.memmap.nr_map: >> - efi_map_regions which reduces the `count` of map entries >> (for e.g. if should_map_region returns false) and which is reflected >> in efi.memmap by __efi_memmap_init. >> At this point efi.memmap.nr_map becomes 46 in the test machine. >> - efi_free_boot_services which also reduces the number of memory regions >> available (for e.g. if md->type or md->attribute is not the right value). >> At this point efi.memmap.nr_map becomes 9 in the test machine. >> Hence when you kexec into a new kernel and pass efi.memmap, the >> paramaters that are compared are: >> efi.memmap.nr_map=9, >> memory_attributes_table->num_entries=48, >> desc_size = 48 >> where the check in [1] is no longer valid with such a low efi.memmap.nr_map >> as it was reduced due to efi_map_regions and efi_free_boot_services. >> >> A more appropriate check is to see if the description size reported by >> efi and memory attributes table is the same. >> >> [1] https://lore.kernel.org/all/20241031175822.2952471-2-ardb+...@google.com/ >> >> Fixes: 8fbe4c49c0cc ("efi/memattr: Ignore table if the size is clearly >> bogus") >> Reported-by: Breno Leitao >> Signed-off-by: Usama Arif >> --- >> drivers/firmware/efi/memattr.c | 16 ++-- >> 1 file changed, 6 insertions(+), 10 deletions(-) >> > > The more I think about this, the more I feel that kexec on x86 should > simply discard this table, and run with the firmware code RWX (which > is not the end of the world). By discard this table, do you mean kexec not use e820_table_firmware? Also a very basic question, what do you mean by run with the firmware RWX? Sorry for the very basic questions above! > > The main reason is that the EFI memory map and the EFI memory > attributes table are supposed to be a matched pair, where each RTcode > entry in the former is broken down into multiple code and data > segments in the latter. The amount of mangling that the x86 arch code > does of the EFI memory map makes it intractable to ensure that they > remain in sync, and so it is better not to bother. > > >> diff --git a/drivers/firmware/efi/memattr.c b/drivers/firmware/efi/memattr.c >> index c38b1a335590..d3bc161361fb 100644 >> --- a/drivers/firmware/efi/memattr.c >> +++ b/drivers/firmware/efi/memattr.c >> @@ -40,21 +40,17 @@ int __init efi_memattr_init(void) >> goto unmap; >> } >> >> - >> /* >> -* Sanity check: the Memory Attributes Table contains up to 3 entries >> -* for each entry of type EfiRuntimeServicesCode in the EFI memory >> map. >> -* So if the size of the table exceeds 3x the size of the entire EFI >> -* memory map, there is clearly something wrong, and the table should >> -* just be ignored altogether. >> +* Sanity check: the Memory Attributes Table desc_size and >> +* efi.memmap.desc_size should match. >> */ >> - size = tbl->num_entries * tbl->desc_size; >> - if (size > 3 * efi.memmap.nr_map * efi.memmap.desc_size) { >> - pr_warn(FW_BUG "Corrupted EFI Memory Attributes Table >> detected! (version == %u, desc_size == %u, num_entries == %u)\n", >> - tbl->version, tbl->desc_size, tbl->num_entries); >> + if (efi.memmap.desc_size != tbl->desc_size) { >> + pr_warn(FW_BUG "Corrupted EFI Memory Attributes Table >> detected! (version == %u, table desc_size == %u, efi.memmap.desc_size == >> %lu, table num_entries == %u)\n", >> + tbl->version, tbl->desc_size, efi.memmap.desc_size, >> tbl->num_entries); >> goto unmap; >> } >> >> + size = tbl->num_entries * tbl->desc_size; >> tbl_size = sizeof(*tbl) + size; >> memblock_reserve(efi_mem_attr_table, tbl_size); >> set_bit(EFI_MEM_ATTR, &efi.flags); >> -- >> 2.43.5 >> -=-=-=-=-=-=-=-=-=-=-=- Groups.io Links: You receive all messages sent to this group. View/Reply Online (#121005): https://edk2.groups.io/g/devel/message/121005 Mute This Topic: https://groups.io/mt/110517813/21656 Group Owner: devel+ow...@edk2.groups.io Unsubscribe: https://edk2.groups.io/g/devel/unsub [arch...@mail-archive.com] -=-=-=-=-=-=-=-=-=-
Re: [edk2-devel] [RFC 2/2] efi/memattr: add efi_mem_attr_table as a reserved region in 820_table_firmware
On 09/01/2025 16:15, Ard Biesheuvel wrote: > On Wed, 8 Jan 2025 at 23:00, Usama Arif wrote: >> >> When this area is not reserved, it comes up as usable in >> /sys/firmware/memmap. This means that kexec, which uses that memmap >> to find usable memory regions, can select the region where >> efi_mem_attr_table is and overwrite it and relocate_kernel. >> >> Since the patch in [1] was merged, all boots after kexec >> are producing the warning that it introduced. >> >> Having a fix in firmware can be difficult to get through. > > I don't follow. I don't think there is anything wrong with the > firmware here. Could you elaborate? > So the problem is, kexec sees this memory as System RAM, and decides it can be used to place an image here. I guess the question is (and I actually don't know the answer here), whose responsibility is it to mark this region as reserved so that its not touched by anyone else. I would have thought it should be firmware? Maybe its not the firmwares' job to mark it as reserved, but just pass it to kernel and the kernel is supposed to make sure it gets reserved in a proper way, even across kexecs. I think in the end whoevers' responsibility it is, the easiest path forward seems to be in kernel? (and not firmware or libstub) > >> The next ideal place would be in libstub. However, it looks like >> InstallMemoryAttributesTable [2] is not available as a boot service >> call option [3], [4], and install_configuration_table does not >> seem to work as a valid substitute. >> > > To do what, exactly? > To change the memory type from System RAM to either reserved or something more appropriate, i.e. any type that is not touched by kexec or any other userspace. Basically the example code I attached at the end of the cover letter in https://lore.kernel.org/all/20250108215957.3437660-1-usamaarif...@gmail.com/ It could be EFI_ACPI_RECLAIM_MEMORY or EFI_RESERVED_TYPE, both of which aren't touched by kexec. >> As a last option for a fix, this patch marks that region as reserved in >> e820_table_firmware if it is currently E820_TYPE_RAM so that kexec doesn't >> use it for kernel segments. >> >> [1] https://lore.kernel.org/all/20241031175822.2952471-2-ardb+...@google.com/ >> [2] >> https://github.com/tianocore/edk2/blob/master/MdeModulePkg/Core/Dxe/Misc/MemoryAttributesTable.c#L100 >> [3] >> https://github.com/tianocore/edk2/blob/42a141800c0c26a09d2344e84a89ce4097a263ae/MdeModulePkg/Core/Dxe/DxeMain/DxeMain.c#L41 >> [4] >> https://elixir.bootlin.com/linux/v6.12.6/source/drivers/firmware/efi/libstub/efistub.h#L327 >> >> Reported-by: Breno Leitao >> Signed-off-by: Usama Arif >> --- >> arch/x86/include/asm/e820/api.h | 2 ++ >> arch/x86/kernel/e820.c | 6 ++ >> arch/x86/platform/efi/efi.c | 9 + >> drivers/firmware/efi/memattr.c | 1 + >> include/linux/efi.h | 7 +++ >> 5 files changed, 25 insertions(+) >> >> diff --git a/arch/x86/include/asm/e820/api.h >> b/arch/x86/include/asm/e820/api.h >> index 2e74a7f0e935..4e9aa24f03bd 100644 >> --- a/arch/x86/include/asm/e820/api.h >> +++ b/arch/x86/include/asm/e820/api.h >> @@ -16,6 +16,8 @@ extern bool e820__mapped_all(u64 start, u64 end, enum >> e820_type type); >> >> extern void e820__range_add (u64 start, u64 size, enum e820_type type); >> extern u64 e820__range_update(u64 start, u64 size, enum e820_type >> old_type, enum e820_type new_type); >> +extern u64 e820__range_update_firmware(u64 start, u64 size, enum e820_type >> old_type, >> + enum e820_type new_type); >> extern u64 e820__range_remove(u64 start, u64 size, enum e820_type >> old_type, bool check_type); >> extern u64 e820__range_update_table(struct e820_table *t, u64 start, u64 >> size, enum e820_type old_type, enum e820_type new_type); >> >> diff --git a/arch/x86/kernel/e820.c b/arch/x86/kernel/e820.c >> index 82b96ed9890a..01d7d3c0d299 100644 >> --- a/arch/x86/kernel/e820.c >> +++ b/arch/x86/kernel/e820.c >> @@ -538,6 +538,12 @@ u64 __init e820__range_update_table(struct e820_table >> *t, u64 start, u64 size, >> return __e820__range_update(t, start, size, old_type, new_type); >> } >> >> +u64 __init e820__range_update_firmware(u64 start, u64 size, enum e820_type >> old_type, >> + enum e820_type new_type) >> +{ >> + return __e820__range_update(e820_table_firmware, start, size, >> old_type, new_type); >> +} >> + >> /* Remove a range of memory from the E820 table: */ >> u64 __init e820__range_remove(u64 start, u64 size, enum e820_type old_type, >> bool check_type) >> { >> diff --git a/arch/x86/platform/efi/efi.c b/arch/x86/platform/efi/efi.c >> index a7ff189421c3..13684c5d7c05 100644 >> --- a/arch/x86/platform/efi/efi.c >> +++ b/arch/x86/platform/efi/efi.c >> @@ -168,6 +168,15 @@ static void __init do_add_efi_memmap(void) >> e820__update_table(e820_table); >> } >> >> +/* Reserve firmware area if it was marked as RAM */ >> +
Re: [edk2-devel] [RFC 2/2] efi/memattr: add efi_mem_attr_table as a reserved region in 820_table_firmware
On 10/01/2025 02:50, Dave Young wrote: > Hi Usama, > > On Thu, 9 Jan 2025 at 06:00, Usama Arif wrote: >> >> When this area is not reserved, it comes up as usable in >> /sys/firmware/memmap. This means that kexec, which uses that memmap >> to find usable memory regions, can select the region where >> efi_mem_attr_table is and overwrite it and relocate_kernel. > > Is the attr table BOOT SERVICE DATA? If so, does efi_mem_reserve() > work for you? > Just refer to esrt.c. > Hi Dave, Its a bit difficult to reproduce the problem and therefore test the fix, but we are seeing it a lot in production. Ard proposed the same thing in https://lore.kernel.org/all/6b4780a5-ada0-405e-9f0a-4d2186177...@gmail.com/ but as I mentioned there, I dont think that efi_mem_reserve would help, as efi_mem_reserve changes e820_table, while kexec looks at /sys/firmware/memmap which uses e820_table_firmware. Thanks, Usama > Thanks > Dave > -=-=-=-=-=-=-=-=-=-=-=- Groups.io Links: You receive all messages sent to this group. View/Reply Online (#121008): https://edk2.groups.io/g/devel/message/121008 Mute This Topic: https://groups.io/mt/110518541/21656 Group Owner: devel+ow...@edk2.groups.io Unsubscribe: https://edk2.groups.io/g/devel/unsub [arch...@mail-archive.com] -=-=-=-=-=-=-=-=-=-=-=-
Re: [edk2-devel] [RFC 2/2] efi/memattr: add efi_mem_attr_table as a reserved region in 820_table_firmware
On 10/01/2025 07:32, Ard Biesheuvel wrote: > On Thu, 9 Jan 2025 at 17:32, Usama Arif wrote: >> >> >> >> On 09/01/2025 16:15, Ard Biesheuvel wrote: >> I think in the end whoevers' responsibility it is, the easiest path forward >> seems to be in kernel? (and not firmware or libstub) >> > > Agreed. But as I pointed out in the other thread, the memory > attributes table only augments the memory map with permission > information, and can be disregarded, and given how badly we mangle the > memory map on x86, maybe this is the right choice here. > >>> The next ideal place would be in libstub. However, it looks like InstallMemoryAttributesTable [2] is not available as a boot service call option [3], [4], and install_configuration_table does not seem to work as a valid substitute. >>> >>> To do what, exactly? >>> >> >> To change the memory type from System RAM to either reserved or >> something more appropriate, i.e. any type that is not touched by >> kexec or any other userspace. >> >> Basically the example code I attached at the end of the cover letter in >> https://lore.kernel.org/all/20250108215957.3437660-1-usamaarif...@gmail.com/ >> It could be EFI_ACPI_RECLAIM_MEMORY or EFI_RESERVED_TYPE, both of which >> aren't >> touched by kexec. >> > > This is a kexec problem (on x86 only) so let's fix it there. I don't believe we can accurately tell if we are booting from a cold boot or kexec. There is bootloader_type available for x86, but not sure if we should rely on that. I think a way forward would be to move it behind a Kconfig option, something like below, which defaults to n for x86. Anyone who needs it can enable it. What do you think? diff --git a/arch/x86/platform/efi/efi.c b/arch/x86/platform/efi/efi.c index aa95f77d7a30..31deb0a5371e 100644 --- a/arch/x86/platform/efi/efi.c +++ b/arch/x86/platform/efi/efi.c @@ -83,7 +83,9 @@ static const unsigned long * const efi_tables[] = { &efi_config_table, &efi.esrt, &prop_phys, +#ifdef CONFIG_EFI_MEMATTR &efi_mem_attr_table, +#endif #ifdef CONFIG_EFI_RCI2_TABLE &rci2_table_phys, #endif diff --git a/drivers/firmware/efi/Kconfig b/drivers/firmware/efi/Kconfig index 72f2537d90ca..b8ecb318768c 100644 --- a/drivers/firmware/efi/Kconfig +++ b/drivers/firmware/efi/Kconfig @@ -287,6 +287,13 @@ config EFI_EMBEDDED_FIRMWARE bool select CRYPTO_LIB_SHA256 +config EFI_MEMATTR + bool "EFI Memory attributes table" + default n if X86_64 + help + EFI Memory Attributes table describes memory protections that may + be applied to the EFI Runtime code and data regions by the kernel. + endmenu config UEFI_CPER diff --git a/drivers/firmware/efi/Makefile b/drivers/firmware/efi/Makefile index a2d0009560d0..c593ec0d9940 100644 --- a/drivers/firmware/efi/Makefile +++ b/drivers/firmware/efi/Makefile @@ -11,7 +11,9 @@ KASAN_SANITIZE_runtime-wrappers.o := n obj-$(CONFIG_ACPI_BGRT)+= efi-bgrt.o -obj-$(CONFIG_EFI) += efi.o vars.o reboot.o memattr.o tpm.o +obj-$(CONFIG_EFI) += efi.o vars.o reboot.o tpm.o +obj-$(CONFIG_EFI_MEMATTR) += memattr.o + obj-$(CONFIG_EFI) += memmap.o ifneq ($(CONFIG_EFI_CAPSULE_LOADER),) obj-$(CONFIG_EFI) += capsule.o diff --git a/drivers/firmware/efi/efi.c b/drivers/firmware/efi/efi.c index fdf07dd6f459..f359179083d5 100644 --- a/drivers/firmware/efi/efi.c +++ b/drivers/firmware/efi/efi.c @@ -596,7 +596,9 @@ static const efi_config_table_type_t common_tables[] __initconst = { {SMBIOS_TABLE_GUID, &efi.smbios, "SMBIOS"}, {SMBIOS3_TABLE_GUID,&efi.smbios3, "SMBIOS 3.0"}, {EFI_SYSTEM_RESOURCE_TABLE_GUID,&efi.esrt, "ESRT" }, +#ifdef CONFIG_EFI_MEMATTR {EFI_MEMORY_ATTRIBUTES_TABLE_GUID, &efi_mem_attr_table, "MEMATTR" }, +#endif {LINUX_EFI_RANDOM_SEED_TABLE_GUID, &efi_rng_seed, "RNG" }, {LINUX_EFI_TPM_EVENT_LOG_GUID, &efi.tpm_log, "TPMEventLog" }, {EFI_TCG2_FINAL_EVENTS_TABLE_GUID, &efi.tpm_final_log, "TPMFinalLog" }, diff --git a/include/linux/efi.h b/include/linux/efi.h index 9c239cdff771..4cf5ebe014e2 100644 --- a/include/linux/efi.h +++ b/include/linux/efi.h @@ -783,9 +783,21 @@ extern unsigned long efi_mem_attr_table; */ typedef int (*efi_memattr_perm_setter)(struct mm_struct *, efi_memory_desc_t *, bool); +#ifdef CONFIG_EFI_MEMATTR extern int efi_memattr_init(void); extern int efi_memattr_apply_permissions(struct mm_struct *mm, efi_memattr_perm_setter fn); +#else +static inline int efi_memattr_init(void) +{ + return 0; +} +static inline int efi_memattr_apply_permissions(struct mm_struct *mm, +
Re: [edk2-devel] [RFC 2/2] efi/memattr: add efi_mem_attr_table as a reserved region in 820_table_firmware
On 10/01/2025 14:31, Usama Arif wrote: > > > On 10/01/2025 07:32, Ard Biesheuvel wrote: >> On Thu, 9 Jan 2025 at 17:32, Usama Arif wrote: >>> >>> >>> >>> On 09/01/2025 16:15, Ard Biesheuvel wrote: > >>> I think in the end whoevers' responsibility it is, the easiest path forward >>> seems to be in kernel? (and not firmware or libstub) >>> >> >> Agreed. But as I pointed out in the other thread, the memory >> attributes table only augments the memory map with permission >> information, and can be disregarded, and given how badly we mangle the >> memory map on x86, maybe this is the right choice here. >> > The next ideal place would be in libstub. However, it looks like > InstallMemoryAttributesTable [2] is not available as a boot service > call option [3], [4], and install_configuration_table does not > seem to work as a valid substitute. > To do what, exactly? >>> >>> To change the memory type from System RAM to either reserved or >>> something more appropriate, i.e. any type that is not touched by >>> kexec or any other userspace. >>> >>> Basically the example code I attached at the end of the cover letter in >>> https://lore.kernel.org/all/20250108215957.3437660-1-usamaarif...@gmail.com/ >>> It could be EFI_ACPI_RECLAIM_MEMORY or EFI_RESERVED_TYPE, both of which >>> aren't >>> touched by kexec. >>> >> >> This is a kexec problem (on x86 only) so let's fix it there. > > > I don't believe we can accurately tell if we are booting from a cold boot or > kexec. > There is bootloader_type available for x86, but not sure if we should rely on > that. I think a way forward would be to move it behind a Kconfig option, > something like > below, which defaults to n for x86. Anyone who needs it can enable it. What > do you think? > Or we can do something like below? diff --git a/drivers/firmware/efi/memattr.c b/drivers/firmware/efi/memattr.c index d131781e2d7b..4add694b18d0 100644 --- a/drivers/firmware/efi/memattr.c +++ b/drivers/firmware/efi/memattr.c @@ -24,6 +24,15 @@ int __init efi_memattr_init(void) efi_memory_attributes_table_t *tbl; unsigned long size; +#ifdef CONFIG_X86_64 + /* +* On x86_64, do not initialize memory attributes table +* if booting from kexec +*/ + if (bootloader_type >> 4 == 0xd) + return 0; +#endif + if (efi_mem_attr_table == EFI_INVALID_TABLE_ADDR) return 0; -=-=-=-=-=-=-=-=-=-=-=- Groups.io Links: You receive all messages sent to this group. View/Reply Online (#121014): https://edk2.groups.io/g/devel/message/121014 Mute This Topic: https://groups.io/mt/110518541/21656 Group Owner: devel+ow...@edk2.groups.io Unsubscribe: https://edk2.groups.io/g/devel/unsub [arch...@mail-archive.com] -=-=-=-=-=-=-=-=-=-=-=-
Re: [edk2-devel] [RFC 1/2] efi/memattr: Use desc_size instead of total size to check for corruption
On 13/01/2025 11:27, Usama Arif wrote: > > > On 13/01/2025 02:33, Dave Young wrote: >> On Fri, 10 Jan 2025 at 18:54, Usama Arif wrote: >>> >>> >>> >>> On 10/01/2025 07:21, Ard Biesheuvel wrote: On Thu, 9 Jan 2025 at 17:36, Usama Arif wrote: > > > > On 09/01/2025 15:45, Ard Biesheuvel wrote: >> On Wed, 8 Jan 2025 at 23:00, Usama Arif wrote: >>> >>> The commit in [1] introduced a check to see if EFI memory attributes >>> table was corrupted. It assumed that efi.memmap.nr_map remains >>> constant, but it changes during late boot. >>> Hence, the check is valid during cold boot, but not in the subsequent >>> kexec boot. >>> >>> This is best explained with an exampled. At cold boot, for a test >>> machine: >>> efi.memmap.nr_map=91, >>> memory_attributes_table->num_entries=48, >>> desc_size = 48 >>> Hence, the check introduced in [1] where 3x the size of the >>> entire EFI memory map is a reasonable upper bound for the size of this >>> table is valid. >>> >>> In late boot __efi_enter_virtual_mode calls 2 functions that updates >>> efi.memmap.nr_map: >>> - efi_map_regions which reduces the `count` of map entries >>> (for e.g. if should_map_region returns false) and which is reflected >>> in efi.memmap by __efi_memmap_init. >>> At this point efi.memmap.nr_map becomes 46 in the test machine. >>> - efi_free_boot_services which also reduces the number of memory regions >>> available (for e.g. if md->type or md->attribute is not the right >>> value). >>> At this point efi.memmap.nr_map becomes 9 in the test machine. >>> Hence when you kexec into a new kernel and pass efi.memmap, the >>> paramaters that are compared are: >>> efi.memmap.nr_map=9, >>> memory_attributes_table->num_entries=48, >>> desc_size = 48 >>> where the check in [1] is no longer valid with such a low >>> efi.memmap.nr_map >>> as it was reduced due to efi_map_regions and efi_free_boot_services. >>> >>> A more appropriate check is to see if the description size reported by >>> efi and memory attributes table is the same. >>> >>> [1] >>> https://lore.kernel.org/all/20241031175822.2952471-2-ardb+...@google.com/ >>> >>> Fixes: 8fbe4c49c0cc ("efi/memattr: Ignore table if the size is clearly >>> bogus") >>> Reported-by: Breno Leitao >>> Signed-off-by: Usama Arif >>> --- >>> drivers/firmware/efi/memattr.c | 16 ++-- >>> 1 file changed, 6 insertions(+), 10 deletions(-) >>> >> >> The more I think about this, the more I feel that kexec on x86 should >> simply discard this table, and run with the firmware code RWX (which >> is not the end of the world). > > > By discard this table, do you mean kexec not use e820_table_firmware? No, I mean kexec ignores the memory attributes table. > Also a very basic question, what do you mean by run with the firmware RWX? > The memory attributes table is an overlay for the EFI memory map that describes which runtime code regions may be mapped with restricted permissions. Without this table, everything will be mapped writable as well as executable, but only in the EFI page tables, which are only active when an EFI call is in progress. >>> >>> Thanks for explaining! >>> >>> So basically get rid of memattr.c :) >>> >>> Do you mean get rid of it only for kexec, or not do it for any >>> boot (including cold boot)? >>> I do like this idea! I couldn't find this in the git history, >>> but do you know if this was added in the linux kernel just >>> because EFI spec added support for it, or if there was a >>> specific security problem? >>> >> >> Usama, can you try the patch below? >> [ format is wrong due to webmail corruption. But if it works I can >> send a formal patch later ] >> >> $ git diff arch/x86 >> diff --git a/arch/x86/platform/efi/quirks.c b/arch/x86/platform/efi/quirks.c >> index 846bf49f2508..58dc77c5210e 100644 >> --- a/arch/x86/platform/efi/quirks.c >> +++ b/arch/x86/platform/efi/quirks.c >> @@ -561,6 +561,11 @@ int __init efi_reuse_config(u64 tables, int nr_tables) >> >> if (!efi_guidcmp(guid, SMBIOS_TABLE_GUID)) >> ((efi_config_table_64_t *)p)->table = data->smbios; >> + >> + /* Not bother to play with mem attr table across kexec */ >> + if (!efi_guidcmp(guid, EFI_MEMORY_ATTRIBUTES_TABLE_GUID)) >> + ((efi_config_table_64_t *)p)->table = >> EFI_INVALID_TABLE_ADDR; >> + >> p += sz; >> } >> > > This would work, I am guessing it will have a similar effect to what I sent > last week in > https://lore.kernel.org/all/fd63613c-fd26-42de-b5ed-cc734f72e...@gmail.com/ > > I think it needs to be wrapped in ifdef CONFIG_X86_64. > IMO we should consider the 2 patches in this seri
Re: [edk2-devel] [RFC 1/2] efi/memattr: Use desc_size instead of total size to check for corruption
On 13/01/2025 02:33, Dave Young wrote: > On Fri, 10 Jan 2025 at 18:54, Usama Arif wrote: >> >> >> >> On 10/01/2025 07:21, Ard Biesheuvel wrote: >>> On Thu, 9 Jan 2025 at 17:36, Usama Arif wrote: On 09/01/2025 15:45, Ard Biesheuvel wrote: > On Wed, 8 Jan 2025 at 23:00, Usama Arif wrote: >> >> The commit in [1] introduced a check to see if EFI memory attributes >> table was corrupted. It assumed that efi.memmap.nr_map remains >> constant, but it changes during late boot. >> Hence, the check is valid during cold boot, but not in the subsequent >> kexec boot. >> >> This is best explained with an exampled. At cold boot, for a test >> machine: >> efi.memmap.nr_map=91, >> memory_attributes_table->num_entries=48, >> desc_size = 48 >> Hence, the check introduced in [1] where 3x the size of the >> entire EFI memory map is a reasonable upper bound for the size of this >> table is valid. >> >> In late boot __efi_enter_virtual_mode calls 2 functions that updates >> efi.memmap.nr_map: >> - efi_map_regions which reduces the `count` of map entries >> (for e.g. if should_map_region returns false) and which is reflected >> in efi.memmap by __efi_memmap_init. >> At this point efi.memmap.nr_map becomes 46 in the test machine. >> - efi_free_boot_services which also reduces the number of memory regions >> available (for e.g. if md->type or md->attribute is not the right >> value). >> At this point efi.memmap.nr_map becomes 9 in the test machine. >> Hence when you kexec into a new kernel and pass efi.memmap, the >> paramaters that are compared are: >> efi.memmap.nr_map=9, >> memory_attributes_table->num_entries=48, >> desc_size = 48 >> where the check in [1] is no longer valid with such a low >> efi.memmap.nr_map >> as it was reduced due to efi_map_regions and efi_free_boot_services. >> >> A more appropriate check is to see if the description size reported by >> efi and memory attributes table is the same. >> >> [1] >> https://lore.kernel.org/all/20241031175822.2952471-2-ardb+...@google.com/ >> >> Fixes: 8fbe4c49c0cc ("efi/memattr: Ignore table if the size is clearly >> bogus") >> Reported-by: Breno Leitao >> Signed-off-by: Usama Arif >> --- >> drivers/firmware/efi/memattr.c | 16 ++-- >> 1 file changed, 6 insertions(+), 10 deletions(-) >> > > The more I think about this, the more I feel that kexec on x86 should > simply discard this table, and run with the firmware code RWX (which > is not the end of the world). By discard this table, do you mean kexec not use e820_table_firmware? >>> >>> No, I mean kexec ignores the memory attributes table. >>> Also a very basic question, what do you mean by run with the firmware RWX? >>> >>> The memory attributes table is an overlay for the EFI memory map that >>> describes which runtime code regions may be mapped with restricted >>> permissions. Without this table, everything will be mapped writable as >>> well as executable, but only in the EFI page tables, which are only >>> active when an EFI call is in progress. >>> >> >> Thanks for explaining! >> >> So basically get rid of memattr.c :) >> >> Do you mean get rid of it only for kexec, or not do it for any >> boot (including cold boot)? >> I do like this idea! I couldn't find this in the git history, >> but do you know if this was added in the linux kernel just >> because EFI spec added support for it, or if there was a >> specific security problem? >> > > Usama, can you try the patch below? > [ format is wrong due to webmail corruption. But if it works I can > send a formal patch later ] > > $ git diff arch/x86 > diff --git a/arch/x86/platform/efi/quirks.c b/arch/x86/platform/efi/quirks.c > index 846bf49f2508..58dc77c5210e 100644 > --- a/arch/x86/platform/efi/quirks.c > +++ b/arch/x86/platform/efi/quirks.c > @@ -561,6 +561,11 @@ int __init efi_reuse_config(u64 tables, int nr_tables) > > if (!efi_guidcmp(guid, SMBIOS_TABLE_GUID)) > ((efi_config_table_64_t *)p)->table = data->smbios; > + > + /* Not bother to play with mem attr table across kexec */ > + if (!efi_guidcmp(guid, EFI_MEMORY_ATTRIBUTES_TABLE_GUID)) > + ((efi_config_table_64_t *)p)->table = > EFI_INVALID_TABLE_ADDR; > + > p += sz; > } > This would work, I am guessing it will have a similar effect to what I sent last week in https://lore.kernel.org/all/fd63613c-fd26-42de-b5ed-cc734f72e...@gmail.com/ I think it needs to be wrapped in ifdef CONFIG_X86_64. -=-=-=-=-=-=-=-=-=-=-=- Groups.io Links: You receive all messages sent to this group. View/Reply Online (#121017): https://edk2.groups.io/g/devel/message/121017 Mute This Topic: https://groups.io/mt/110517813/21656 Grou
Re: [edk2-devel] [RFC 2/2] efi/memattr: add efi_mem_attr_table as a reserved region in 820_table_firmware
On 10/01/2025 11:20, Dave Young wrote: > On Fri, 10 Jan 2025 at 19:18, Dave Young wrote: >> >> On Fri, 10 Jan 2025 at 19:12, Usama Arif wrote: >>> >>> >>> >>> On 10/01/2025 02:50, Dave Young wrote: Hi Usama, On Thu, 9 Jan 2025 at 06:00, Usama Arif wrote: > > When this area is not reserved, it comes up as usable in > /sys/firmware/memmap. This means that kexec, which uses that memmap > to find usable memory regions, can select the region where > efi_mem_attr_table is and overwrite it and relocate_kernel. Is the attr table BOOT SERVICE DATA? If so, does efi_mem_reserve() work for you? Just refer to esrt.c. >>> >>> Hi Dave, >>> >>> Its a bit difficult to reproduce the problem and therefore test the fix, but >>> we are seeing it a lot in production. Ard proposed the same thing in >>> https://lore.kernel.org/all/6b4780a5-ada0-405e-9f0a-4d2186177...@gmail.com/ >>> but as I mentioned there, I dont think that efi_mem_reserve would help, >>> as efi_mem_reserve changes e820_table, while kexec looks at >>> /sys/firmware/memmap which uses e820_table_firmware. >> >> I sent a question to pm people, if the sysfs memmap comes from >> e820_table then it will be fine. Let's see: > s/e820_table/e820_table_kexec > Do you mean change /sys/firmware/memmap to point to e820_table_kexec instead of e820_table_firmware [1]? I thought of doing this when the first bug was encountered last year, but didn't do it as I thought it would be frowned upon to change what sysfs file exposes to userspace. [1] https://elixir.bootlin.com/linux/v6.12.6/source/arch/x86/kernel/e820.c#L31 >> https://lore.kernel.org/all/CALu+AoS-nk4u=9UYP7BLS=diOxjJRf+vfv7KHXG=uxozoya...@mail.gmail.com/ >> >>> >>> Thanks, >>> Usama >>> Thanks Dave >>> > -=-=-=-=-=-=-=-=-=-=-=- Groups.io Links: You receive all messages sent to this group. View/Reply Online (#121012): https://edk2.groups.io/g/devel/message/121012 Mute This Topic: https://groups.io/mt/110518541/21656 Group Owner: devel+ow...@edk2.groups.io Unsubscribe: https://edk2.groups.io/g/devel/unsub [arch...@mail-archive.com] -=-=-=-=-=-=-=-=-=-=-=-
Re: [edk2-devel] [RFC 1/2] efi/memattr: Use desc_size instead of total size to check for corruption
On 10/01/2025 07:21, Ard Biesheuvel wrote: > On Thu, 9 Jan 2025 at 17:36, Usama Arif wrote: >> >> >> >> On 09/01/2025 15:45, Ard Biesheuvel wrote: >>> On Wed, 8 Jan 2025 at 23:00, Usama Arif wrote: The commit in [1] introduced a check to see if EFI memory attributes table was corrupted. It assumed that efi.memmap.nr_map remains constant, but it changes during late boot. Hence, the check is valid during cold boot, but not in the subsequent kexec boot. This is best explained with an exampled. At cold boot, for a test machine: efi.memmap.nr_map=91, memory_attributes_table->num_entries=48, desc_size = 48 Hence, the check introduced in [1] where 3x the size of the entire EFI memory map is a reasonable upper bound for the size of this table is valid. In late boot __efi_enter_virtual_mode calls 2 functions that updates efi.memmap.nr_map: - efi_map_regions which reduces the `count` of map entries (for e.g. if should_map_region returns false) and which is reflected in efi.memmap by __efi_memmap_init. At this point efi.memmap.nr_map becomes 46 in the test machine. - efi_free_boot_services which also reduces the number of memory regions available (for e.g. if md->type or md->attribute is not the right value). At this point efi.memmap.nr_map becomes 9 in the test machine. Hence when you kexec into a new kernel and pass efi.memmap, the paramaters that are compared are: efi.memmap.nr_map=9, memory_attributes_table->num_entries=48, desc_size = 48 where the check in [1] is no longer valid with such a low efi.memmap.nr_map as it was reduced due to efi_map_regions and efi_free_boot_services. A more appropriate check is to see if the description size reported by efi and memory attributes table is the same. [1] https://lore.kernel.org/all/20241031175822.2952471-2-ardb+...@google.com/ Fixes: 8fbe4c49c0cc ("efi/memattr: Ignore table if the size is clearly bogus") Reported-by: Breno Leitao Signed-off-by: Usama Arif --- drivers/firmware/efi/memattr.c | 16 ++-- 1 file changed, 6 insertions(+), 10 deletions(-) >>> >>> The more I think about this, the more I feel that kexec on x86 should >>> simply discard this table, and run with the firmware code RWX (which >>> is not the end of the world). >> >> >> By discard this table, do you mean kexec not use e820_table_firmware? > > No, I mean kexec ignores the memory attributes table. > >> Also a very basic question, what do you mean by run with the firmware RWX? >> > > The memory attributes table is an overlay for the EFI memory map that > describes which runtime code regions may be mapped with restricted > permissions. Without this table, everything will be mapped writable as > well as executable, but only in the EFI page tables, which are only > active when an EFI call is in progress. > Thanks for explaining! So basically get rid of memattr.c :) Do you mean get rid of it only for kexec, or not do it for any boot (including cold boot)? I do like this idea! I couldn't find this in the git history, but do you know if this was added in the linux kernel just because EFI spec added support for it, or if there was a specific security problem? Thanks! -=-=-=-=-=-=-=-=-=-=-=- Groups.io Links: You receive all messages sent to this group. View/Reply Online (#121007): https://edk2.groups.io/g/devel/message/121007 Mute This Topic: https://groups.io/mt/110517813/21656 Group Owner: devel+ow...@edk2.groups.io Unsubscribe: https://edk2.groups.io/g/devel/unsub [arch...@mail-archive.com] -=-=-=-=-=-=-=-=-=-=-=-