> should it fail how can I get the log when my system completely locks
up and dies?

My understanding of the bug is that it occurs when the NVIDIA driver
tries to bring up the DisplayPort link on certain DisplayPort monitors.
You should be able to drive the system in text mode with a failing
driver, to capture the log. One way to do that would be to add
"systemd.unit=multi-user.target" to the kernel command line. (When the
bootloader menu comes up, edit the boot options and add this argument to
the line that starts with 'linux'.) If you happen to have another system
on the same network you can even try SSHing into the system that
reproduces the problem, then start the graphical session manually with
`sudo systemctl isolate graphical`, and if the crash doesn't take down
the SSH session, you ought to be able to generate the log file while the
problem is occurring. (Log files taken while a bug is actively being
reproduced tend to be the most useful.) Depending on the exact nature of
the crash, you may even be able to SSH into a system which has already
crashed, without having to go to the effort of starting it in text mode
first.

> In that eventuality I've only been able to recover by booting an older
kernel with the older drivers. Will that not invalidate the log content
on restart?

It may, or may not. nvidia-bug-report.sh attempts to capture logs from
the previous boot, if they have been retained. If you do find that you
are unable to capture a log file using a driver version which reproduces
the problem, just make sure to note that you captured the log file after
rebooting to a known good driver.

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to nvidia-graphics-drivers-460 in Ubuntu.
https://bugs.launchpad.net/bugs/1930733

Title:
  Kernel oops with the 460.80 and 465.27 drivers when using DP, but not
  with HDMI

Status in nvidia-graphics-drivers-460 package in Ubuntu:
  Triaged
Status in nvidia-graphics-drivers-465 package in Ubuntu:
  Triaged

Bug description:
  I get a kernel oops with the 460.80 and 465.27 drivers on Hirsute

  Jun 03 16:26:57 willow kernel: Oops: 0000 [#1] SMP PTI
  Jun 03 16:26:57 willow kernel: CPU: 7 PID: 2004 Comm: Xorg Tainted: P         
  OE     5.11.0-18-generic #19-Ubuntu
  Jun 03 16:26:57 willow kernel: Hardware name: System manufacturer System 
Product Name/PRIME H270M-PLUS, BIOS 1605 12/13/2019
  Jun 03 16:26:57 willow kernel: RIP: 0010:_nv015534rm+0x1b6/0x330 [nvidia]
  Jun 03 16:26:57 willow kernel: Code: 8b 87 68 05 00 00 ba 01 00 00 00 be 02 
00 00 00 e8 bf eb 55 c8 41 83 c5 01 41 83 fd 1f 0f 84 0b 01 00 00 48 8b 45 10 
44 89 ee <48> 8b b8 70 01 00 00 48 8b 87 d8 04 00 00 e8>
  Jun 03 16:26:57 willow kernel: RSP: 0000:ffffaf4201893958 EFLAGS: 00010297
  Jun 03 16:26:57 willow kernel: RAX: 0000000000000000 RBX: 0000000000000400 
RCX: 0000000000000003
  Jun 03 16:26:57 willow kernel: RDX: 0000000000000004 RSI: 0000000000000003 
RDI: 0000000000000000
  Jun 03 16:26:57 willow kernel: RBP: ffff8e318220add0 R08: 0000000000000001 
R09: ffff8e318220acb8
  Jun 03 16:26:57 willow kernel: R10: ffff8e3182204008 R11: 0000000010100000 
R12: 0000000000000400
  Jun 03 16:26:57 willow kernel: R13: 0000000000000003 R14: ffff8e3186ca8010 
R15: 0000000000000800
  Jun 03 16:26:57 willow kernel: FS:  00007f5807f38a40(0000) 
GS:ffff8e3466dc0000(0000) knlGS:0000000000000000
  Jun 03 16:26:57 willow kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 
0000000080050033
  Jun 03 16:26:57 willow kernel: CR2: 0000000000000170 CR3: 0000000140710005 
CR4: 00000000003706e0
  Jun 03 16:26:57 willow kernel: DR0: 0000000000000000 DR1: 0000000000000000 
DR2: 0000000000000000
  Jun 03 16:26:57 willow kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 
DR7: 0000000000000400
  Jun 03 16:26:57 willow kernel: Call Trace:
  Jun 03 16:26:57 willow kernel:  ? _nv015556rm+0x7fd/0x1020 [nvidia]
  Jun 03 16:26:57 willow kernel:  ? _nv027155rm+0x22c/0x4f0 [nvidia]
  Jun 03 16:26:57 willow kernel:  ? _nv017787rm+0x303/0x5e0 [nvidia]
  Jun 03 16:26:57 willow kernel:  ? _nv017789rm+0xe1/0x220 [nvidia]
  Jun 03 16:26:57 willow kernel:  ? _nv022829rm+0xed/0x220 [nvidia]
  Jun 03 16:26:57 willow kernel:  ? _nv023065rm+0x30/0x60 [nvidia]
  Jun 03 16:26:57 willow kernel:  ? _nv000704rm+0x16da/0x22b0 [nvidia]
  Jun 03 16:26:57 willow kernel:  ? rm_init_adapter+0xc5/0xe0 [nvidia]
  Jun 03 16:26:57 willow kernel:  ? nv_open_device+0x122/0x8e0 [nvidia]
  Jun 03 16:26:57 willow kernel:  ? nvidia_open+0x2b7/0x560 [nvidia]
  Jun 03 16:26:57 willow kernel:  ? nvidia_frontend_open+0x58/0xa0 [nvidia]
  Jun 03 16:26:57 willow kernel:  ? chrdev_open+0xf7/0x220
  Jun 03 16:26:57 willow kernel:  ? cdev_device_add+0x90/0x90
  Jun 03 16:26:57 willow kernel:  ? do_dentry_open+0x156/0x370
  Jun 03 16:26:57 willow kernel:  ? vfs_open+0x2d/0x30
  Jun 03 16:26:57 willow kernel:  ? do_open+0x1c3/0x340
  Jun 03 16:26:57 willow kernel:  ? path_openat+0x10a/0x1d0
  Jun 03 16:26:57 willow kernel:  ? do_filp_open+0x8c/0x130
  Jun 03 16:26:57 willow kernel:  ? __check_object_size+0x1c/0x20
  Jun 03 16:26:57 willow kernel:  ? do_sys_openat2+0x9b/0x150
  Jun 03 16:26:57 willow kernel:  ? __x64_sys_openat+0x56/0x90
  Jun 03 16:26:57 willow kernel:  ? do_syscall_64+0x38/0x90
  Jun 03 16:26:57 willow kernel:  ? entry_SYSCALL_64_after_hwframe+0x44/0xa9
  Jun 03 16:26:57 willow kernel: Modules linked in: snd_seq_dummy snd_hrtimer 
vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) binfmt_misc zfs(PO) zunicode(PO) 
zzstd(O) zlua(O) zavl(PO) icp(PO) zcommon(PO) znvpair(PO) >
  Jun 03 16:26:57 willow kernel:  sunrpc ip_tables x_tables autofs4 btrfs 
blake2b_generic xor raid6_pq libcrc32c hid_generic usbhid hid crct10dif_pclmul 
crc32_pclmul ghash_clmulni_intel aesni_intel crypto_simd c>
  Jun 03 16:26:57 willow kernel: CR2: 0000000000000170
  Jun 03 16:26:57 willow kernel: ---[ end trace 0013b6989b267f32 ]---
  Jun 03 16:26:57 willow kernel: RIP: 0010:_nv015534rm+0x1b6/0x330 [nvidia]
  Jun 03 16:26:57 willow kernel: Code: 8b 87 68 05 00 00 ba 01 00 00 00 be 02 
00 00 00 e8 bf eb 55 c8 41 83 c5 01 41 83 fd 1f 0f 84 0b 01 00 00 48 8b 45 10 
44 89 ee <48> 8b b8 70 01 00 00 48 8b 87 d8 04 00 00 e8>
  Jun 03 16:26:57 willow kernel: RSP: 0000:ffffaf4201893958 EFLAGS: 00010297
  Jun 03 16:26:57 willow kernel: RAX: 0000000000000000 RBX: 0000000000000400 
RCX: 0000000000000003
  Jun 03 16:26:57 willow kernel: RDX: 0000000000000004 RSI: 0000000000000003 
RDI: 0000000000000000
  Jun 03 16:26:57 willow kernel: RBP: ffff8e318220add0 R08: 0000000000000001 
R09: ffff8e318220acb8
  Jun 03 16:26:57 willow kernel: R10: ffff8e3182204008 R11: 0000000010100000 
R12: 0000000000000400
  Jun 03 16:26:57 willow kernel: R13: 0000000000000003 R14: ffff8e3186ca8010 
R15: 0000000000000800
  Jun 03 16:26:57 willow kernel: FS:  00007f5807f38a40(0000) 
GS:ffff8e3466dc0000(0000) knlGS:0000000000000000
  Jun 03 16:26:57 willow kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 
0000000080050033
  Jun 03 16:26:57 willow kernel: CR2: 0000000000000170 CR3: 0000000140710005 
CR4: 00000000003706e0
  Jun 03 16:26:57 willow kernel: DR0: 0000000000000000 DR1: 0000000000000000 
DR2: 0000000000000000
  Jun 03 16:26:57 willow kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 
DR7: 0000000000000400
  Jun 03 16:26:57 willow kernel: general protection fault, probably for 
non-canonical address 0xc483480f75000000: 0000 [#2] SMP PTI
  Jun 03 16:26:57 willow kernel: CPU: 3 PID: 2004 Comm: Xorg Tainted: P      D  
  OE     5.11.0-18-generic #19-Ubuntu
  Jun 03 16:26:57 willow kernel: Hardware name: System manufacturer System 
Product Name/PRIME H270M-PLUS, BIOS 1605 12/13/2019
  Jun 03 16:26:57 willow kernel: RIP: 0010:_nv009368rm+0x3c/0x340 [nvidia]
  Jun 03 16:26:57 willow kernel: Code: 07 0f 1f 44 00 00 31 d2 48 8b 07 48 85 
c0 75 1a e9 a1 02 00 00 66 0f 1f 84 00 00 00 00 00 48 8b 48 10 48 85 c9 74 17 
48 89 c8 <48> 39 30 77 ef 0f 83 29 02 00 00 48 8b 48 18>
  Jun 03 16:26:57 willow kernel: RSP: 0018:ffffaf4201893d50 EFLAGS: 00010086
  Jun 03 16:26:57 willow kernel: RAX: c483480f75000000 RBX: ffffaf4201893d98 
RCX: c483480f75000000
  Jun 03 16:26:57 willow kernel: RDX: ffffaf4201893de8 RSI: 00000000000007d4 
RDI: ffffffffc2b9f6d8
  Jun 03 16:26:57 willow kernel: RBP: ffff8e3143782ff0 R08: 0000000000000001 
R09: 0000000000000000
  Jun 03 16:26:57 willow kernel: R10: 0000000000000001 R11: 0000000000000000 
R12: 0000000000000000
  Jun 03 16:26:57 willow kernel: R13: ffffffffc2b9fec0 R14: ffffaf4201893e80 
R15: ffffffffc2b9cb00
  Jun 03 16:26:57 willow kernel: FS:  0000000000000000(0000) 
GS:ffff8e3466cc0000(0000) knlGS:0000000000000000
  Jun 03 16:26:57 willow kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 
0000000080050033
  Jun 03 16:26:57 willow kernel: CR2: 00007fa1b0004038 CR3: 00000001105e6006 
CR4: 00000000003706e0
  Jun 03 16:26:57 willow kernel: DR0: 0000000000000000 DR1: 0000000000000000 
DR2: 0000000000000000
  Jun 03 16:26:57 willow kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 
DR7: 0000000000000400
  Jun 03 16:26:57 willow kernel: Call Trace:
  Jun 03 16:26:57 willow kernel:  ? _nv039616rm+0xdf/0x1e0 [nvidia]
  Jun 03 16:26:57 willow kernel:  ? rm_cleanup_file_private+0x42/0x140 [nvidia]
  Jun 03 16:26:57 willow kernel:  ? os_free_mem+0x22/0x30 [nvidia]
  Jun 03 16:26:57 willow kernel:  ? nvidia_close+0x156/0x320 [nvidia]
  Jun 03 16:26:57 willow kernel:  ? nvidia_frontend_close+0x2f/0x50 [nvidia]
  Jun 03 16:26:57 willow kernel:  ? __fput+0x9f/0x250
  Jun 03 16:26:57 willow kernel:  ? ____fput+0xe/0x10
  Jun 03 16:26:57 willow kernel:  ? task_work_run+0x6d/0xa0
  Jun 03 16:26:57 willow kernel:  ? do_exit+0x233/0x3e0
  Jun 03 16:26:57 willow kernel:  ? rewind_stack_do_exit+0x17/0x20
  Jun 03 16:26:57 willow kernel: Modules linked in: snd_seq_dummy snd_hrtimer 
vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) binfmt_misc zfs(PO) zunicode(PO) 
zzstd(O) zlua(O) zavl(PO) icp(PO) zcommon(PO) znvpair(PO) >
  Jun 03 16:26:57 willow kernel:  sunrpc ip_tables x_tables autofs4 btrfs 
blake2b_generic xor raid6_pq libcrc32c hid_generic usbhid hid crct10dif_pclmul 
crc32_pclmul ghash_clmulni_intel aesni_intel crypto_simd c>
  Jun 03 16:26:57 willow kernel: ---[ end trace 0013b6989b267f33 ]---
  Jun 03 16:26:57 willow kernel: RIP: 0010:_nv015534rm+0x1b6/0x330 [nvidia]
  Jun 03 16:26:57 willow kernel: Code: 8b 87 68 05 00 00 ba 01 00 00 00 be 02 
00 00 00 e8 bf eb 55 c8 41 83 c5 01 41 83 fd 1f 0f 84 0b 01 00 00 48 8b 45 10 
44 89 ee <48> 8b b8 70 01 00 00 48 8b 87 d8 04 00 00 e8>
  Jun 03 16:26:57 willow kernel: RSP: 0000:ffffaf4201893958 EFLAGS: 00010297
  Jun 03 16:26:57 willow kernel: RAX: 0000000000000000 RBX: 0000000000000400 
RCX: 0000000000000003
  Jun 03 16:26:57 willow kernel: RDX: 0000000000000004 RSI: 0000000000000003 
RDI: 0000000000000000
  Jun 03 16:26:57 willow kernel: RBP: ffff8e318220add0 R08: 0000000000000001 
R09: ffff8e318220acb8
  Jun 03 16:26:57 willow kernel: R10: ffff8e3182204008 R11: 0000000010100000 
R12: 0000000000000400
  Jun 03 16:26:57 willow kernel: R13: 0000000000000003 R14: ffff8e3186ca8010 
R15: 0000000000000800
  Jun 03 16:26:57 willow kernel: FS:  0000000000000000(0000) 
GS:ffff8e3466cc0000(0000) knlGS:0000000000000000
  Jun 03 16:26:57 willow kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 
0000000080050033
  Jun 03 16:26:57 willow kernel: CR2: 00007fa1b0004038 CR3: 00000001105e6006 
CR4: 00000000003706e0
  Jun 03 16:26:57 willow kernel: DR0: 0000000000000000 DR1: 0000000000000000 
DR2: 0000000000000000
  Jun 03 16:26:57 willow kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 
DR7: 0000000000000400
  Jun 03 16:26:57 willow kernel: Fixing recursive fault but reboot is needed!

  
  switching to a tty or login with ssh does not work at that point.

  Installing the nvidia-driver-450-server from advanced mode got me a
  desktop back.

  I found this upstream bugreport:
  https://forums.developer.nvidia.com/t/465-24-02-page-fault/175782/62

  where people on different distributions have this problem, and it
  might be related to DP. My monitor is connected with DP.

  From lspci:

  01:00.0 VGA compatible controller: NVIDIA Corporation GP107 [GeForce
  GTX 1050 Ti] (rev a1)

  I also have a laptop with a 1050ti where I can use these drivers on
  21.04 without problem, while only the inbuilt screen is connected. I
  will try the laptop with DP, and my desktop with HDMI later

  ProblemType: Bug
  DistroRelease: Ubuntu 21.04
  Package: nvidia-driver-465 (not installed)
  ProcVersionSignature: Ubuntu 5.11.0-18.19-generic 5.11.17
  Uname: Linux 5.11.0-18-generic x86_64
  NonfreeKernelModules: zfs zunicode zavl icp zcommon znvpair nvidia_modeset 
nvidia
  ApportVersion: 2.20.11-0ubuntu65.1
  Architecture: amd64
  CasperMD5CheckResult: unknown
  CurrentDesktop: ubuntu:GNOME
  Date: Thu Jun  3 18:14:16 2021
  EcryptfsInUse: Yes
  InstallationDate: Installed on 2013-02-22 (3022 days ago)
  InstallationMedia: Ubuntu 12.10 "Quantal Quetzal" - Release amd64 (20121017.5)
  SourcePackage: nvidia-graphics-drivers-465
  UpgradeStatus: No upgrade log present (probably fresh install)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/nvidia-graphics-drivers-460/+bug/1930733/+subscriptions


-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to