> should it fail how can I get the log when my system completely locks up and dies?
My understanding of the bug is that it occurs when the NVIDIA driver tries to bring up the DisplayPort link on certain DisplayPort monitors. You should be able to drive the system in text mode with a failing driver, to capture the log. One way to do that would be to add "systemd.unit=multi-user.target" to the kernel command line. (When the bootloader menu comes up, edit the boot options and add this argument to the line that starts with 'linux'.) If you happen to have another system on the same network you can even try SSHing into the system that reproduces the problem, then start the graphical session manually with `sudo systemctl isolate graphical`, and if the crash doesn't take down the SSH session, you ought to be able to generate the log file while the problem is occurring. (Log files taken while a bug is actively being reproduced tend to be the most useful.) Depending on the exact nature of the crash, you may even be able to SSH into a system which has already crashed, without having to go to the effort of starting it in text mode first. > In that eventuality I've only been able to recover by booting an older kernel with the older drivers. Will that not invalidate the log content on restart? It may, or may not. nvidia-bug-report.sh attempts to capture logs from the previous boot, if they have been retained. If you do find that you are unable to capture a log file using a driver version which reproduces the problem, just make sure to note that you captured the log file after rebooting to a known good driver. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to nvidia-graphics-drivers-460 in Ubuntu. https://bugs.launchpad.net/bugs/1930733 Title: Kernel oops with the 460.80 and 465.27 drivers when using DP, but not with HDMI Status in nvidia-graphics-drivers-460 package in Ubuntu: Triaged Status in nvidia-graphics-drivers-465 package in Ubuntu: Triaged Bug description: I get a kernel oops with the 460.80 and 465.27 drivers on Hirsute Jun 03 16:26:57 willow kernel: Oops: 0000 [#1] SMP PTI Jun 03 16:26:57 willow kernel: CPU: 7 PID: 2004 Comm: Xorg Tainted: P OE 5.11.0-18-generic #19-Ubuntu Jun 03 16:26:57 willow kernel: Hardware name: System manufacturer System Product Name/PRIME H270M-PLUS, BIOS 1605 12/13/2019 Jun 03 16:26:57 willow kernel: RIP: 0010:_nv015534rm+0x1b6/0x330 [nvidia] Jun 03 16:26:57 willow kernel: Code: 8b 87 68 05 00 00 ba 01 00 00 00 be 02 00 00 00 e8 bf eb 55 c8 41 83 c5 01 41 83 fd 1f 0f 84 0b 01 00 00 48 8b 45 10 44 89 ee <48> 8b b8 70 01 00 00 48 8b 87 d8 04 00 00 e8> Jun 03 16:26:57 willow kernel: RSP: 0000:ffffaf4201893958 EFLAGS: 00010297 Jun 03 16:26:57 willow kernel: RAX: 0000000000000000 RBX: 0000000000000400 RCX: 0000000000000003 Jun 03 16:26:57 willow kernel: RDX: 0000000000000004 RSI: 0000000000000003 RDI: 0000000000000000 Jun 03 16:26:57 willow kernel: RBP: ffff8e318220add0 R08: 0000000000000001 R09: ffff8e318220acb8 Jun 03 16:26:57 willow kernel: R10: ffff8e3182204008 R11: 0000000010100000 R12: 0000000000000400 Jun 03 16:26:57 willow kernel: R13: 0000000000000003 R14: ffff8e3186ca8010 R15: 0000000000000800 Jun 03 16:26:57 willow kernel: FS: 00007f5807f38a40(0000) GS:ffff8e3466dc0000(0000) knlGS:0000000000000000 Jun 03 16:26:57 willow kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Jun 03 16:26:57 willow kernel: CR2: 0000000000000170 CR3: 0000000140710005 CR4: 00000000003706e0 Jun 03 16:26:57 willow kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 Jun 03 16:26:57 willow kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 Jun 03 16:26:57 willow kernel: Call Trace: Jun 03 16:26:57 willow kernel: ? _nv015556rm+0x7fd/0x1020 [nvidia] Jun 03 16:26:57 willow kernel: ? _nv027155rm+0x22c/0x4f0 [nvidia] Jun 03 16:26:57 willow kernel: ? _nv017787rm+0x303/0x5e0 [nvidia] Jun 03 16:26:57 willow kernel: ? _nv017789rm+0xe1/0x220 [nvidia] Jun 03 16:26:57 willow kernel: ? _nv022829rm+0xed/0x220 [nvidia] Jun 03 16:26:57 willow kernel: ? _nv023065rm+0x30/0x60 [nvidia] Jun 03 16:26:57 willow kernel: ? _nv000704rm+0x16da/0x22b0 [nvidia] Jun 03 16:26:57 willow kernel: ? rm_init_adapter+0xc5/0xe0 [nvidia] Jun 03 16:26:57 willow kernel: ? nv_open_device+0x122/0x8e0 [nvidia] Jun 03 16:26:57 willow kernel: ? nvidia_open+0x2b7/0x560 [nvidia] Jun 03 16:26:57 willow kernel: ? nvidia_frontend_open+0x58/0xa0 [nvidia] Jun 03 16:26:57 willow kernel: ? chrdev_open+0xf7/0x220 Jun 03 16:26:57 willow kernel: ? cdev_device_add+0x90/0x90 Jun 03 16:26:57 willow kernel: ? do_dentry_open+0x156/0x370 Jun 03 16:26:57 willow kernel: ? vfs_open+0x2d/0x30 Jun 03 16:26:57 willow kernel: ? do_open+0x1c3/0x340 Jun 03 16:26:57 willow kernel: ? path_openat+0x10a/0x1d0 Jun 03 16:26:57 willow kernel: ? do_filp_open+0x8c/0x130 Jun 03 16:26:57 willow kernel: ? __check_object_size+0x1c/0x20 Jun 03 16:26:57 willow kernel: ? do_sys_openat2+0x9b/0x150 Jun 03 16:26:57 willow kernel: ? __x64_sys_openat+0x56/0x90 Jun 03 16:26:57 willow kernel: ? do_syscall_64+0x38/0x90 Jun 03 16:26:57 willow kernel: ? entry_SYSCALL_64_after_hwframe+0x44/0xa9 Jun 03 16:26:57 willow kernel: Modules linked in: snd_seq_dummy snd_hrtimer vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) binfmt_misc zfs(PO) zunicode(PO) zzstd(O) zlua(O) zavl(PO) icp(PO) zcommon(PO) znvpair(PO) > Jun 03 16:26:57 willow kernel: sunrpc ip_tables x_tables autofs4 btrfs blake2b_generic xor raid6_pq libcrc32c hid_generic usbhid hid crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel crypto_simd c> Jun 03 16:26:57 willow kernel: CR2: 0000000000000170 Jun 03 16:26:57 willow kernel: ---[ end trace 0013b6989b267f32 ]--- Jun 03 16:26:57 willow kernel: RIP: 0010:_nv015534rm+0x1b6/0x330 [nvidia] Jun 03 16:26:57 willow kernel: Code: 8b 87 68 05 00 00 ba 01 00 00 00 be 02 00 00 00 e8 bf eb 55 c8 41 83 c5 01 41 83 fd 1f 0f 84 0b 01 00 00 48 8b 45 10 44 89 ee <48> 8b b8 70 01 00 00 48 8b 87 d8 04 00 00 e8> Jun 03 16:26:57 willow kernel: RSP: 0000:ffffaf4201893958 EFLAGS: 00010297 Jun 03 16:26:57 willow kernel: RAX: 0000000000000000 RBX: 0000000000000400 RCX: 0000000000000003 Jun 03 16:26:57 willow kernel: RDX: 0000000000000004 RSI: 0000000000000003 RDI: 0000000000000000 Jun 03 16:26:57 willow kernel: RBP: ffff8e318220add0 R08: 0000000000000001 R09: ffff8e318220acb8 Jun 03 16:26:57 willow kernel: R10: ffff8e3182204008 R11: 0000000010100000 R12: 0000000000000400 Jun 03 16:26:57 willow kernel: R13: 0000000000000003 R14: ffff8e3186ca8010 R15: 0000000000000800 Jun 03 16:26:57 willow kernel: FS: 00007f5807f38a40(0000) GS:ffff8e3466dc0000(0000) knlGS:0000000000000000 Jun 03 16:26:57 willow kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Jun 03 16:26:57 willow kernel: CR2: 0000000000000170 CR3: 0000000140710005 CR4: 00000000003706e0 Jun 03 16:26:57 willow kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 Jun 03 16:26:57 willow kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 Jun 03 16:26:57 willow kernel: general protection fault, probably for non-canonical address 0xc483480f75000000: 0000 [#2] SMP PTI Jun 03 16:26:57 willow kernel: CPU: 3 PID: 2004 Comm: Xorg Tainted: P D OE 5.11.0-18-generic #19-Ubuntu Jun 03 16:26:57 willow kernel: Hardware name: System manufacturer System Product Name/PRIME H270M-PLUS, BIOS 1605 12/13/2019 Jun 03 16:26:57 willow kernel: RIP: 0010:_nv009368rm+0x3c/0x340 [nvidia] Jun 03 16:26:57 willow kernel: Code: 07 0f 1f 44 00 00 31 d2 48 8b 07 48 85 c0 75 1a e9 a1 02 00 00 66 0f 1f 84 00 00 00 00 00 48 8b 48 10 48 85 c9 74 17 48 89 c8 <48> 39 30 77 ef 0f 83 29 02 00 00 48 8b 48 18> Jun 03 16:26:57 willow kernel: RSP: 0018:ffffaf4201893d50 EFLAGS: 00010086 Jun 03 16:26:57 willow kernel: RAX: c483480f75000000 RBX: ffffaf4201893d98 RCX: c483480f75000000 Jun 03 16:26:57 willow kernel: RDX: ffffaf4201893de8 RSI: 00000000000007d4 RDI: ffffffffc2b9f6d8 Jun 03 16:26:57 willow kernel: RBP: ffff8e3143782ff0 R08: 0000000000000001 R09: 0000000000000000 Jun 03 16:26:57 willow kernel: R10: 0000000000000001 R11: 0000000000000000 R12: 0000000000000000 Jun 03 16:26:57 willow kernel: R13: ffffffffc2b9fec0 R14: ffffaf4201893e80 R15: ffffffffc2b9cb00 Jun 03 16:26:57 willow kernel: FS: 0000000000000000(0000) GS:ffff8e3466cc0000(0000) knlGS:0000000000000000 Jun 03 16:26:57 willow kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Jun 03 16:26:57 willow kernel: CR2: 00007fa1b0004038 CR3: 00000001105e6006 CR4: 00000000003706e0 Jun 03 16:26:57 willow kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 Jun 03 16:26:57 willow kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 Jun 03 16:26:57 willow kernel: Call Trace: Jun 03 16:26:57 willow kernel: ? _nv039616rm+0xdf/0x1e0 [nvidia] Jun 03 16:26:57 willow kernel: ? rm_cleanup_file_private+0x42/0x140 [nvidia] Jun 03 16:26:57 willow kernel: ? os_free_mem+0x22/0x30 [nvidia] Jun 03 16:26:57 willow kernel: ? nvidia_close+0x156/0x320 [nvidia] Jun 03 16:26:57 willow kernel: ? nvidia_frontend_close+0x2f/0x50 [nvidia] Jun 03 16:26:57 willow kernel: ? __fput+0x9f/0x250 Jun 03 16:26:57 willow kernel: ? ____fput+0xe/0x10 Jun 03 16:26:57 willow kernel: ? task_work_run+0x6d/0xa0 Jun 03 16:26:57 willow kernel: ? do_exit+0x233/0x3e0 Jun 03 16:26:57 willow kernel: ? rewind_stack_do_exit+0x17/0x20 Jun 03 16:26:57 willow kernel: Modules linked in: snd_seq_dummy snd_hrtimer vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) binfmt_misc zfs(PO) zunicode(PO) zzstd(O) zlua(O) zavl(PO) icp(PO) zcommon(PO) znvpair(PO) > Jun 03 16:26:57 willow kernel: sunrpc ip_tables x_tables autofs4 btrfs blake2b_generic xor raid6_pq libcrc32c hid_generic usbhid hid crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel crypto_simd c> Jun 03 16:26:57 willow kernel: ---[ end trace 0013b6989b267f33 ]--- Jun 03 16:26:57 willow kernel: RIP: 0010:_nv015534rm+0x1b6/0x330 [nvidia] Jun 03 16:26:57 willow kernel: Code: 8b 87 68 05 00 00 ba 01 00 00 00 be 02 00 00 00 e8 bf eb 55 c8 41 83 c5 01 41 83 fd 1f 0f 84 0b 01 00 00 48 8b 45 10 44 89 ee <48> 8b b8 70 01 00 00 48 8b 87 d8 04 00 00 e8> Jun 03 16:26:57 willow kernel: RSP: 0000:ffffaf4201893958 EFLAGS: 00010297 Jun 03 16:26:57 willow kernel: RAX: 0000000000000000 RBX: 0000000000000400 RCX: 0000000000000003 Jun 03 16:26:57 willow kernel: RDX: 0000000000000004 RSI: 0000000000000003 RDI: 0000000000000000 Jun 03 16:26:57 willow kernel: RBP: ffff8e318220add0 R08: 0000000000000001 R09: ffff8e318220acb8 Jun 03 16:26:57 willow kernel: R10: ffff8e3182204008 R11: 0000000010100000 R12: 0000000000000400 Jun 03 16:26:57 willow kernel: R13: 0000000000000003 R14: ffff8e3186ca8010 R15: 0000000000000800 Jun 03 16:26:57 willow kernel: FS: 0000000000000000(0000) GS:ffff8e3466cc0000(0000) knlGS:0000000000000000 Jun 03 16:26:57 willow kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Jun 03 16:26:57 willow kernel: CR2: 00007fa1b0004038 CR3: 00000001105e6006 CR4: 00000000003706e0 Jun 03 16:26:57 willow kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 Jun 03 16:26:57 willow kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 Jun 03 16:26:57 willow kernel: Fixing recursive fault but reboot is needed! switching to a tty or login with ssh does not work at that point. Installing the nvidia-driver-450-server from advanced mode got me a desktop back. I found this upstream bugreport: https://forums.developer.nvidia.com/t/465-24-02-page-fault/175782/62 where people on different distributions have this problem, and it might be related to DP. My monitor is connected with DP. From lspci: 01:00.0 VGA compatible controller: NVIDIA Corporation GP107 [GeForce GTX 1050 Ti] (rev a1) I also have a laptop with a 1050ti where I can use these drivers on 21.04 without problem, while only the inbuilt screen is connected. I will try the laptop with DP, and my desktop with HDMI later ProblemType: Bug DistroRelease: Ubuntu 21.04 Package: nvidia-driver-465 (not installed) ProcVersionSignature: Ubuntu 5.11.0-18.19-generic 5.11.17 Uname: Linux 5.11.0-18-generic x86_64 NonfreeKernelModules: zfs zunicode zavl icp zcommon znvpair nvidia_modeset nvidia ApportVersion: 2.20.11-0ubuntu65.1 Architecture: amd64 CasperMD5CheckResult: unknown CurrentDesktop: ubuntu:GNOME Date: Thu Jun 3 18:14:16 2021 EcryptfsInUse: Yes InstallationDate: Installed on 2013-02-22 (3022 days ago) InstallationMedia: Ubuntu 12.10 "Quantal Quetzal" - Release amd64 (20121017.5) SourcePackage: nvidia-graphics-drivers-465 UpgradeStatus: No upgrade log present (probably fresh install) To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/nvidia-graphics-drivers-460/+bug/1930733/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp