On Fri, Sep 13, 2024 at 10:30:56AM +0200, Christophe Leroy wrote:
>
>
> Le 13/09/2024 à 09:38, Luming Yu a écrit :
> > On Fri, Sep 13, 2024 at 08:54:12AM +0200, Christophe Leroy wrote:
> > >
> > >
> > > Le 13/09/2024 à 03:40, Luming Yu a écrit :
> > > > On Thu, Sep 12, 2024 at 12:23:29PM +0200, Christophe Leroy wrote:
> > > > >
> > > > >
> > > > > Le 12/09/2024 à 10:24, Luming Yu a écrit :
> > > > > > From: Yu Luming <luming...@gmail.com>
> > > > > >
> > > > > > convert powerpc entry code in syscall and fault to use syscall_work
> > > > > > and irqentry_state as well as common calls from generic entry
> > > > > > infrastructure.
> > > > > >
> > > > > > Signed-off-by: Luming Yu <luming...@shingroup.cn>
> > > > > > ---
> > > > > > arch/powerpc/Kconfig | 1 +
> > > > > > arch/powerpc/include/asm/hw_irq.h | 5 +++++
> > > > > > arch/powerpc/include/asm/processor.h | 6 ++++++
> > > > > > arch/powerpc/include/asm/syscall.h | 5 +++++
> > > > > > arch/powerpc/include/asm/thread_info.h | 1 +
> > > > > > arch/powerpc/kernel/syscall.c | 6 +++++-
> > > > > > arch/powerpc/mm/fault.c | 5 +++++
> > > > > > 7 files changed, 28 insertions(+), 1 deletion(-)
> > > > >
> > > > > There is another build problem:
> > > > >
> > > > > CC kernel/entry/common.o
> > > > > kernel/entry/common.c: In function 'irqentry_exit':
> > > > > kernel/entry/common.c:335:21: error: implicit declaration of function
> > > > > 'regs_irqs_disabled'; did you mean 'raw_irqs_disabled'?
> > > > > [-Werror=implicit-function-declaration]
> > > > > 335 | } else if (!regs_irqs_disabled(regs)) {
> > > > > | ^~~~~~~~~~~~~~~~~~
> > > > > | raw_irqs_disabled
> > > > >
> > > > >
> > > > > You have put regs_irqs_disabled() in a section dedicated to PPC64, so
> > > > > it
> > > > > fails on PPC32.
> > > > >
> > > > >
> > > > > After fixing this problem and providing an empty asm/entry-common.h
> > > > > it is
> > > > > now possible to build the kernel. But that's not enough, the board is
> > > > > stuck
> > > > > after:
> > > > >
> > > > > ...
> > > > > [ 2.871391] Freeing unused kernel image (initmem) memory: 1228K
> > > > > [ 2.877990] Run /init as init process
> > > >
> > > > Thanks for these questions. :-)
> > > > I haven't gotten chance to run it in ppc32 qemu.
> > > > the common syscall trace enter lost this hunk
> > > > - if (!is_32bit_task())
> > > > - audit_syscall_entry(regs->gpr[0], regs->gpr[3],
> > > > regs->gpr[4],
> > > > - regs->gpr[5], regs->gpr[6]);
> > > > - else
> > > > - audit_syscall_entry(regs->gpr[0],
> > > > - regs->gpr[3] & 0xffffffff,
> > > > - regs->gpr[4] & 0xffffffff,
> > > > - regs->gpr[5] & 0xffffffff,
> > > > - regs->gpr[6] & 0xffffffff);
> > > > which I don't understand whether we need a arch callbacks for it.
> > >
> > > I don't thing so.
> > >
> > > As far as I can see, audit_syscall_entry() is called by
> > > syscall_enter_audit() in kernel/entry/common.c
> > >
> > > And the masking of arguments based on is_32bit_task() is done in
> > > syscall_get_arguments() with is called by
> > > syscall_enter_audit() just before calling audit_syscall_entry() and which
> > > is
> > > an arch callback that does the same as the removed hunk.
> > so, syscall_get_arguments is the ppc arch callback. thanks. :-)
> > > >
> > > > Before I sent out the RFC patch set, the very limited compile and boot
> > > > test goes well with a ppc64 qemu VM. Surely, there will be a lot of
> > > > test, debug and following up patch set update that is necessary to make
> > > > it a complete convert.
> > >
> > > Even on ppc64 it doesn't build, at the first place because
> > > arch/powerpc/include/asm/entry-common.h is missing in your patch. Did you
> > > forget to 'git add' it ?
> > oh, I forget that I was testing this patch on top of the early user
> > notifier patch:
> > https://eur01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Flinuxppc%2Fissues%2Fissues%2F477&data=05%7C02%7Cchristophe.leroy%40csgroup.eu%7C35a08ca9a81f4c6ff8ce08dcd3c73555%7C8b87af7d86474dc78df45f69a2011bb5%7C0%7C0%7C638618099770810941%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C0%7C%7C%7C&sdata=yCQWLIAXL%2BNHnzrh0e91WIBvF0c5WfF6pY6ZMHstocA%3D&reserved=0,
> >
> > https://eur01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fpatchwork.ozlabs.org%2Fproject%2Flinuxppc-dev%2Fpatch%2F1FD36D52828D2506%2B20231218031309.2063-1-luming.yu%40shingroup.cn%2F&data=05%7C02%7Cchristophe.leroy%40csgroup.eu%7C35a08ca9a81f4c6ff8ce08dcd3c73555%7C8b87af7d86474dc78df45f69a2011bb5%7C0%7C0%7C638618099770819779%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C0%7C%7C%7C&sdata=0WBSUlpAbL6EMdPEAtJv1HUHkbeUjjUcP98wYf9IxM4%3D&reserved=0
> > and the entry-common.h is as follows:
> > [root@localhost linux]# cat arch/powerpc/include/asm/entry-common.h
> > /* SPDX-License-Identifier: GPL-2.0 */
> > #ifndef ARCH_POWERPC_ENTRY_COMMON_H
> > #define ARCH_POWERPC_ENTRY_COMMON_H
> >
> > #include <linux/user-return-notifier.h>
> >
> > static inline void arch_exit_to_user_mode_prepare(struct pt_regs *regs,
> > unsigned long ti_work)
> > {
> > if (ti_work & _TIF_USER_RETURN_NOTIFY)
> > fire_user_return_notifiers();
> > }
> >
> > #define arch_exit_to_user_mode_prepare arch_exit_to_user_mode_prepare
> >
> > #endif
> >
> > As you could see , it looks irrelevant.
> > >
> > > And same as with PPC32, when I build PPC64 with an empty
> > > asm/entry-common.h,
> > > it doesn't work. So, I guess you had some needed code in that file and you
> > > have to send it.
> >
> > please send me your kernel config and let me reproduce and double check
> > if there could be another bit of code make the difference. My test p8 64bit
> > VM boots
> > an unmodified fedora 38 user space just fine with the patch.
> > After boot, the only difference is make install of kernel could hang for a
> > while, that
> > I'm not sure how that could happen yet. and I plan to do some unit test for
> > many common
> > features like : lockdep, ptrace, rcu, audit...
> > >
> >
>
>
> I tested with both ppc64_defconfig and ppc64le_defconfig. With those two
> configs QEMU remains stuck after the following lines. I use QEMU with
> machine pseries:
>
> [ 2.701874][ T1] Running code patching self-tests ...
> [ 2.734007][ T1] registered taskstats version 1
> [ 2.735891][ T1] Loading compiled-in X.509 certificates
> [ 2.756472][ T1] Loaded X.509 cert 'Build time autogenerated kernel
> key: 003f25410c2285abf44b4685b6d63682a2dc9f11'
> [ 2.784474][ T1] Demotion targets for Node 0: null
> [ 2.784748][ T1] page_owner is disabled
> [ 2.786795][ T1] Secure boot mode disabled
> [ 2.787436][ T1] ima: No TPM chip found, activating TPM-bypass!
> [ 2.787568][ T1] Loading compiled-in module X.509 certificates
> [ 2.790854][ T1] Loaded X.509 cert 'Build time autogenerated kernel
> key: 003f25410c2285abf44b4685b6d63682a2dc9f11'
> [ 2.791289][ T1] ima: Allocated hash algorithm: sha256
> [ 2.796260][ T1] Secure boot mode disabled
> [ 2.796448][ T1] Trusted boot mode disabled
> [ 2.796560][ T1] ima: No architecture policies found
> [ 2.799580][ T1] printk: legacy console [netcon0] enabled
> [ 2.799696][ T1] netconsole: network logging started
> [ 2.880282][ T1] Freeing unused kernel image (initmem) memory: 6528K
> [ 2.884618][ T1] Checked W+X mappings: passed, no W+X pages found
> [ 2.885366][ T1] rodata_test: all tests were successful
> [ 2.885749][ T1] Run /init as init process
> [ 3.214730][ T58] mount (58) used greatest stack depth: 28672 bytes
> left
> ...
> nothing happens after that.
reproduced with ppc64_defconfig
[ 0.818972][ T1] Run /init as init process
[ 5.851684][ T240] request_module: modprobe binfmt-4c46 cannot be
processed, kmod busy with 50 threads for more than 5 seconds now
[ 5.851742][ T240] kworker/u33:18 (240) used greatest stack depth: 13584
bytes left
[ 5.860081][ T232] kworker/u33:16 (232) used greatest stack depth: 13072
bytes left
[ 5.863145][ T210] kworker/u35:13 (210) used greatest stack depth: 12928
bytes left
[ 5.865000][ T1] Failed to execute /init (error -8)
[ 5.868897][ T1] Run /sbin/init as init process
[ 10.891673][ T315] request_module: modprobe binfmt-4c46 cannot be
processed, kmod busy with 50 threads for more than 5 seconds now
[ 10.894036][ T1] Starting init: /sbin/init exists but couldn't execute it
(error -8)
[ 10.901455][ T1] Run /etc/init as init process
[ 10.903154][ T1] Run /bin/init as init process
[ 10.904747][ T1] Run /bin/sh as init process
[ 15.931679][ T367] request_module: modprobe binfmt-4c46 cannot be
processed, kmod busy with 50 threads for more than 5 seconds now
[ 15.934689][ T1] Starting init: /bin/sh exists but couldn't execute it
(error -8)
[ 15.940766][ T1] Kernel panic - not syncing: No working init found. Try
passing init= option to kernel. See Linux Documentation/admin-guide/init.rst
for guidance.
[ 15.946981][ T1] CPU: 13 PID: 1 Comm: swapper/0 Not tainted 6.7.0-rc1 #24
[ 15.950050][ T1] Hardware name: IBM pSeries (emulated by qemu) POWER8
(raw) 0x4d0200 0xf000004 of:SLOF,git-3a259d hv:linux,kvm pSeries
[ 15.955633][ T1] Call Trace:
[ 15.956714][ T1] [c0000000080fbd20] [c0000000011811e8]
dump_stack_lvl+0x6c/0x9c (unreliable)
[ 15.959302][ T1] [c0000000080fbd50] [c00000000016b654] panic+0x178/0x424
[ 15.961929][ T1] [c0000000080fbdf0] [c00000000001048c]
kernel_init+0x19c/0x1a0
[ 15.965310][ T1] [c0000000080fbe50] [c00000000000d394]
ret_from_kernel_user_thread+0x14/0x1c
[ 15.968494][ T1] --- interrupt: 0 at 0x0
[ 15.989760][ T1] pstore: backend (nvram) writing error (-1)
qemu-system-ppc64: OS terminated: `
[root@localhost home]#
It would take some time to debug out from the issue. Stay tuned.
Thanks very much!
>
>
> For PPC32 I use pmac32_defconfig and QEMU machine mac99
>
> Christophe
>