https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=282994
Bug ID: 282994 Summary: Repeated kernel panics Product: Base System Version: 14.2-RELEASE Hardware: amd64 OS: Any Status: New Severity: Affects Some People Priority: --- Component: kern Assignee: b...@freebsd.org Reporter: jsml4thwwbid6...@protonmail.com Hello, I've run into an issue where multiple FreeBSD 14.1-p6 and 14.2-RC1 servers are crashing with a kernel panic and rebooting. This happens multiple times a day on each server. Sometimes it takes a few hours, sometimes it crashes in minutes of startup. Please see the 14.2-RC1 dump below. -------------------------------------------------------------------------------- Reading symbols from /boot/kernel/kernel... Reading symbols from /usr/lib/debug//boot/kernel/kernel.debug... Unread portion of the kernel message buffer: Fatal trap 12: page fault while in kernel mode cpuid = 6; apic id = 12 fault virtual address = 0xd0 fault code = supervisor write data, page not present instruction pointer = 0x20:0xffffffff80bdc2f2 stack pointer = 0x28:0xfffffe03505a1d30 frame pointer = 0x28:0xfffffe03505a1d60 code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, long 1, def32 0, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 81514 (php) rdi: fffff8010000d900 rsi: fffff810d3d97210 rdx: fffff8150afe4c00 rcx: fffff8010000dd90 r8: 000000000000001a r9: 0000000000000000 rax: 0000000000000000 rbx: fffff80cc151c500 rbp: fffffe03505a1d60 r10: 0000000000000000 r11: 0000000000000001 r12: 0000000000000180 r13: 00000000f4cf7018 r14: fffff802fc69b800 r15: 0000000000000000 trap number = 12 panic: page fault cpuid = 6 time = 1732641372 KDB: stack backtrace: #0 0xffffffff80b8b89d at kdb_backtrace+0x5d #1 0xffffffff80b3dc01 at vpanic+0x131 #2 0xffffffff80b3dac3 at panic+0x43 #3 0xffffffff81025a0b at trap_fatal+0x40b #4 0xffffffff81025a56 at trap_pfault+0x46 #5 0xffffffff80ffc398 at calltrap+0x8 #6 0xffffffff80bdcbe3 at kern_shm_open2+0x443 #7 0xffffffff80bddc31 at sys_shm_open2+0x21 #8 0xffffffff810262c5 at amd64_syscall+0x115 #9 0xffffffff80ffccab at fast_syscall_common+0xf8 Uptime: 1h33m33s Dumping 8536 out of 163689 MB:..1%..11%..21%..31%..41%..51%..61%..71%..81%..91% Reading symbols from /boot/kernel/accf_http.ko... Reading symbols from /usr/lib/debug//boot/kernel/accf_http.ko.debug... Reading symbols from /boot/kernel/accf_data.ko... Reading symbols from /usr/lib/debug//boot/kernel/accf_data.ko.debug... Reading symbols from /boot/kernel/zfs.ko... Reading symbols from /usr/lib/debug//boot/kernel/zfs.ko.debug... Reading symbols from /boot/kernel/accf_dns.ko... Reading symbols from /usr/lib/debug//boot/kernel/accf_dns.ko.debug... Reading symbols from /boot/kernel/geom_mirror.ko... Reading symbols from /usr/lib/debug//boot/kernel/geom_mirror.ko.debug... Reading symbols from /boot/kernel/cryptodev.ko... Reading symbols from /usr/lib/debug//boot/kernel/cryptodev.ko.debug... Reading symbols from /boot/kernel/mac_seeotheruids.ko... Reading symbols from /usr/lib/debug//boot/kernel/mac_seeotheruids.ko.debug... Reading symbols from /boot/kernel/coretemp.ko... Reading symbols from /usr/lib/debug//boot/kernel/coretemp.ko.debug... Reading symbols from /boot/kernel/mlx4en.ko... Reading symbols from /usr/lib/debug//boot/kernel/mlx4en.ko.debug... Reading symbols from /boot/kernel/mlx4.ko... Reading symbols from /usr/lib/debug//boot/kernel/mlx4.ko.debug... Reading symbols from /boot/kernel/pf.ko... Reading symbols from /usr/lib/debug//boot/kernel/pf.ko.debug... Reading symbols from /boot/kernel/fusefs.ko... Reading symbols from /usr/lib/debug//boot/kernel/fusefs.ko.debug... Reading symbols from /boot/kernel/acpi_wmi.ko... Reading symbols from /usr/lib/debug//boot/kernel/acpi_wmi.ko.debug... Reading symbols from /boot/kernel/if_lagg.ko... Reading symbols from /usr/lib/debug//boot/kernel/if_lagg.ko.debug... Reading symbols from /boot/kernel/if_infiniband.ko... Reading symbols from /usr/lib/debug//boot/kernel/if_infiniband.ko.debug... Reading symbols from /boot/kernel/if_bridge.ko... Reading symbols from /usr/lib/debug//boot/kernel/if_bridge.ko.debug... Reading symbols from /boot/kernel/bridgestp.ko... Reading symbols from /usr/lib/debug//boot/kernel/bridgestp.ko.debug... Reading symbols from /boot/kernel/mac_ntpd.ko... Reading symbols from /usr/lib/debug//boot/kernel/mac_ntpd.ko.debug... Reading symbols from /boot/kernel/nullfs.ko... --Type <RET> for more, q to quit, c to continue without paging-- Reading symbols from /usr/lib/debug//boot/kernel/nullfs.ko.debug... Reading symbols from /boot/kernel/if_epair.ko... Reading symbols from /usr/lib/debug//boot/kernel/if_epair.ko.debug... __curthread () at /usr/src/sys/amd64/include/pcpu_aux.h:57 57 __asm("movq %%gs:%P1,%0" : "=r" (td) : "n" (offsetof(struct pcpu, (kgdb) frame 8 #8 shm_alloc (ucred=0xfffff80cc151c500, mode=mode@entry=384, largepage=<optimized out>) at /usr/src/sys/kern/uipc_shm.c:954 954 obj->un_pager.swp.swp_priv = shmfd; (kgdb) p vfs_hash_tbl $1 = (struct vfs_hash_head *) 0xfffffe0109b12000 (kgdb) p mp No symbol "mp" in current context. (kgdb) p hash No symbol "hash" in current context. (kgdb) p *mp No symbol "mp" in current context. (kgdb) ------------------ The servers this is happening on all have a few things in common. 1. They all access data storage over MooseFS. 2. They all use fusefs-libs3-3.16.2 and moosefs3-client-3.0.117_1 3. They all have many nullfs mounts. 5k+ 4: All user process are run inside standard thick jails. 5. They all use RCTL to control resources used by users. 6. They all run many hundreds of processes owned by different users. All the process access data over the network, including the data files which are distributed storage. Please let me know if here is anything I can do to help. -- You are receiving this mail because: You are the assignee for the bug.