Just got this triggered with 5.4.0-15-generic #18-Ubuntu from focal-
proposed.

Logs below.

If you want me to run another kernel, or try patched kernels, I can do
that.


Kernel log:

----8<----
Feb 25 10:08:08 vesho kernel: i915 0000:00:02.0: GPU HANG: ecode 
9:1:0x00000000, hang on rcs0
Feb 25 10:08:08 vesho kernel: i915 0000:00:02.0: Resetting rcs0 for hang on rcs0
Feb 25 10:08:08 vesho kernel: [drm:gen8_reset_engines [i915]] *ERROR* rcs0 
reset request timed out: {request: 00000001, RESET_CTL: 00000001}
Feb 25 10:08:08 vesho kernel: i915 0000:00:02.0: Resetting chip for hang on rcs0
Feb 25 10:08:08 vesho kernel: [drm:gen8_reset_engines [i915]] *ERROR* rcs0 
reset request timed out: {request: 00000001, RESET_CTL: 00000001}
Feb 25 10:08:08 vesho kernel: [drm:gen8_reset_engines [i915]] *ERROR* rcs0 
reset request timed out: {request: 00000001, RESET_CTL: 00000001}
Feb 25 10:08:16 vesho kernel: i915 0000:00:02.0: Resetting rcs0 for hang on rcs0
Feb 25 10:08:24 vesho kernel: i915 0000:00:02.0: Resetting rcs0 for hang on rcs0
Feb 25 10:08:26 vesho kernel: GpuWatchdog[15034]: segfault at 0 ip 
000055af78399e32 sp 00007f8b359414c0 error 6 in chrome[55af74453000+7287000]
Feb 25 10:08:26 vesho kernel: Code: 83 c3 e8 75 e9 41 8b 85 00 01 00 00 85 c0 
0f 84 99 00 00 00 48 8d 3d 63 61 4b fb be 01 00 00 00 ba 03 00 00 00 e8 fe 17 
a6 fe <c7> 04 25 00>
Feb 25 10:08:28 vesho kernel: i915 0000:00:02.0: Resetting rcs0 for hang on rcs0
...
Feb 25 10:09:18 vesho kernel: i915 0000:00:02.0: Resetting rcs0 for hang on rcs0
Feb 25 10:09:20 vesho kernel: i915 0000:00:02.0: GPU recovery timed out, 
cancelling all in-flight rendering.
Feb 25 10:09:20 vesho kernel: i915 0000:00:02.0: Resetting chip for hang on rcs0
Feb 25 10:09:22 vesho kernel: i915 0000:00:02.0: GPU recovery timed out, 
cancelling all in-flight rendering.
Feb 25 10:09:22 vesho kernel: i915 0000:00:02.0: Resetting chip for hang on rcs0
Feb 25 10:09:22 vesho kernel: fbcon: Taking over console
Feb 25 10:09:23 vesho kernel: Console: switching to colour frame buffer device 
240x67
Feb 25 10:09:30 vesho kernel: i915 0000:00:02.0: Resetting rcs0 for hang on rcs0
Feb 25 10:09:38 vesho kernel: i915 0000:00:02.0: Resetting rcs0 for hang on rcs0
...
Feb 25 10:10:32 vesho kernel: i915 0000:00:02.0: Resetting rcs0 for hang on rcs0
Feb 25 10:10:34 vesho kernel: i915 0000:00:02.0: GPU recovery timed out, 
cancelling all in-flight rendering.
Feb 25 10:10:34 vesho kernel: i915 0000:00:02.0: Resetting chip for hang on rcs0
Feb 25 10:10:36 vesho kernel: i915 0000:00:02.0: GPU recovery timed out, 
cancelling all in-flight rendering.
Feb 25 10:10:36 vesho kernel: i915 0000:00:02.0: Resetting chip for hang on rcs0
Feb 25 10:10:46 vesho kernel: GpuWatchdog[53827]: segfault at 0 ip 
000055f286b35e32 sp 00007f396a9a24c0 error 6 in chrome[55f282bef000+7287000]
Feb 25 10:10:46 vesho kernel: Code: 83 c3 e8 75 e9 41 8b 85 00 01 00 00 85 c0 
0f 84 99 00 00 00 48 8d 3d 63 61 4b fb be 01 00 00 00 ba 03 00 00 00 e8 fe 17 
a6 fe <c7> 04 25 00>
Feb 25 10:10:56 vesho kernel: GpuWatchdog[53920]: segfault at 0 ip 
0000555c042dee32 sp 00007f446a7b24c0 error 6 in chrome[555c00398000+7287000]
Feb 25 10:10:56 vesho kernel: Code: 83 c3 e8 75 e9 41 8b 85 00 01 00 00 85 c0 
0f 84 99 00 00 00 48 8d 3d 63 61 4b fb be 01 00 00 00 ba 03 00 00 00 e8 fe 17 
a6 fe <c7> 04 25 00>
----8<----

/sys/class/drm/card0/error:

----8<----
GPU HANG: ecode 9:1:0x00000000, hang on rcs0
Kernel: 5.4.0-15-generic x86_64
Driver: 20190822
Time: 1582618088 s 331762 us
Boottime: 57363 s 47980 us
Uptime: 834 s 211397 us
Epoch: 4297439752 jiffies (250 HZ)
Capture: 4297439752 jiffies; 8224 ms ago, 0 ms after epoch
Reset count: 0
Suspend count: 1
Platform: KABYLAKE
Subplatform: 0x0
PCI ID: 0x591b
PCI Revision: 0x04
PCI Subsystem: 1028:07bf
IOMMU enabled?: 1
DMC loaded: yes
DMC fw version: 1.4
GT awake: yes
RPM wakelock: yes
PM suspended: no
EIR: 0x00000000
IER: 0x08080000
GTIER[0]: 0x01010101
GTIER[1]: 0x01010101
GTIER[2]: 0x00000070
GTIER[3]: 0x00000101
PGTBL_ER: 0x00000000
FORCEWAKE: 0x00010001
DERRMR: 0x2077efef
CCID: 0x00000000
  fence[0] = 00000000
  fence[1] = 00000000
  fence[2] = 00000000
  fence[3] = 00000000
  fence[4] = 00000000
  fence[5] = a43f097074c0001
  fence[6] = 00000000
  fence[7] = 00000000
  fence[8] = 00000000
  fence[9] = 00000000
  fence[10] = 00000000
  fence[11] = 00000000
  fence[12] = 00000000
  fence[13] = 00000000
  fence[14] = 00000000
  fence[15] = 00000000
  fence[16] = 00000000
  fence[17] = 00000000
  fence[18] = 00000000
  fence[19] = 00000000
  fence[20] = 00000000
  fence[21] = 00000000
  fence[22] = 00000000
  fence[23] = 00000000
  fence[24] = 00000000
  fence[25] = 00000000
  fence[26] = 00000000
  fence[27] = 00000000
  fence[28] = 00000000
  fence[29] = 00000000
  fence[30] = 00000000
  fence[31] = 00000000
ERROR: 0x00000000
DONE_REG: 0xffffffff
FAULT_TLB_DATA: 0x00000019 0xf4bbf313
Num Pipes: 3
Pipe [0]:
  Power: on
  SRC: 077f0437
  STAT: 00000000
Plane [0]:
  CNTR: c4042400
  STRIDE: 00000026
  SURF: 08500000
  TILEOFF: 000d0240
Cursor [0]:
  CNTR: 00000000
  POS: 00000000
  BASE: 00000000
Pipe [1]:
  Power: on
  SRC: 09ff059f
  STAT: 00000000
Plane [1]:
  CNTR: c4043000
  STRIDE: 00000050
  SURF: 01700000
  TILEOFF: 00000000
Cursor [1]:
  CNTR: 00000000
  POS: 00000000
  BASE: 00000000
Pipe [2]:
  Power: on
  SRC: 09ff059f
  STAT: 00000000
Plane [2]:
  CNTR: c4043000
  STRIDE: 00000050
  SURF: 03700000
  TILEOFF: 00000000
Cursor [2]:
  CNTR: 00000000
  POS: 00000000
  BASE: 00000000
CPU transcoder: A
  Power: on
  CONF: 00000000
  HTOTAL: 00000000
  HBLANK: 00000000
  HSYNC: 00000000
  VTOTAL: 00000000
  VBLANK: 00000000
  VSYNC: 00000000
CPU transcoder: B
  Power: on
  CONF: c0000000
  HTOTAL: 0a9f09ff
  HBLANK: 0a9f09ff
  HSYNC: 0a4f0a2f
  VTOTAL: 05c8059f
  VBLANK: 05c8059f
  VSYNC: 05a705a2
CPU transcoder: C
  Power: on
  CONF: c0000000
  HTOTAL: 0a9f09ff
  HBLANK: 0a9f09ff
  HSYNC: 0a4f0a2f
  VTOTAL: 05c8059f
  VBLANK: 05c8059f
  VSYNC: 05a705a2
CPU transcoder: EDP
  Power: on
  CONF: c0000000
  HTOTAL: 081f077f
  HBLANK: 081f077f
  HSYNC: 07cf07af
  VTOTAL: 04560437
  VBLANK: 04560437
  VSYNC: 043f043a
is_mobile: no
is_lp: no
require_force_probe: no
has_64bit_reloc: yes
gpu_reset_clobbers_display: no
has_reset_engine: yes
has_fpga_dbg: yes
has_global_mocs: no
has_gt_uc: yes
has_l3_dpf: no
has_llc: yes
has_logical_ring_contexts: yes
has_logical_ring_elsq: no
has_logical_ring_preemption: yes
has_pooled_eu: no
has_rc6: yes
has_rc6p: no
has_rps: yes
has_runtime_pm: yes
has_snoop: no
has_coherent_ggtt: yes
unfenced_needs_alignment: no
hws_needs_physical: no
cursor_needs_physical: no
has_csr: yes
has_ddi: yes
has_dp_mst: yes
has_fbc: yes
has_gmch: no
has_hotplug: yes
has_ipc: yes
has_modular_fia: no
has_overlay: no
has_psr: yes
overlay_needs_physical: no
supports_tv: no
Has logical contexts? yes
scheduler: 1f
slice0: 3 subslice(s) (0x7):
        subslice0: 8 EUs (0xff)
        subslice1: 8 EUs (0xff)
        subslice2: 8 EUs (0xff)
        subslice3: 0 EUs (0x0)
slice1: 0 subslice(s) (0x0):
        subslice0: 0 EUs (0x0)
        subslice1: 0 EUs (0x0)
        subslice2: 0 EUs (0x0)
        subslice3: 0 EUs (0x0)
slice2: 0 subslice(s) (0x0):
        subslice0: 0 EUs (0x0)
        subslice1: 0 EUs (0x0)
        subslice2: 0 EUs (0x0)
        subslice3: 0 EUs (0x0)
i915.vbt_firmware=(null)
i915.modeset=-1
i915.lvds_channel_mode=0
i915.panel_use_ssc=-1
i915.vbt_sdvo_panel_type=-1
i915.enable_dc=-1
i915.enable_fbc=1
i915.enable_psr=0
i915.disable_power_well=1
i915.enable_ips=1
i915.invert_brightness=0
i915.enable_guc=0
i915.guc_log_level=-1
i915.guc_firmware_path=(null)
i915.huc_firmware_path=(null)
i915.dmc_firmware_path=(null)
i915.mmio_debug=0
i915.edp_vswing=0
i915.reset=2
i915.inject_load_failure=0
i915.fastboot=-1
i915.enable_dpcd_backlight=-1
i915.force_probe=
i915.alpha_support=no
i915.enable_hangcheck=yes
i915.prefault_disable=no
i915.load_detect_test=no
i915.force_reset_modeset_test=no
i915.error_capture=yes
i915.disable_display=no
i915.verbose_state_checks=yes
i915.nuclear_pageflip=no
i915.enable_dp_mst=yes
i915.enable_gvt=no
GuC firmware:
        status: DISABLED
        version: wanted 33.0, found 0.0
        uCode: 0 bytes
        RSA: 0 bytes
HuC firmware: (null)
        status: N/A
        version: wanted 0.0, found 0.0
        uCode: 0 bytes
        RSA: 0 bytes
----8<----

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1861395

Title:
  system hang: i915 Resetting rcs0 for hang on rcs0

Status in Linux:
  Fix Released
Status in linux package in Ubuntu:
  Incomplete
Status in linux source package in Eoan:
  Invalid
Status in linux source package in Focal:
  Incomplete

Bug description:
  System hangs, unknown cause, When this happens, the mouse pointer
  still moves, but I can't do anything else with the keys or clicking in
  the UI.  Only recover I have found is a hard power-off

  Last bit of kern.log below:

  Jan 30 12:43:51 aries kernel: [ 6649.263031] i915 0000:00:02.0: GPU HANG: 
ecode 9:1:0x00000000, hang on rcs0
  Jan 30 12:43:51 aries kernel: [ 6649.263032] GPU hangs can indicate a bug 
anywhere in the entire gfx stack, including userspace.
  Jan 30 12:43:51 aries kernel: [ 6649.263033] Please file a _new_ bug report 
on bugs.freedesktop.org against DRI -> DRM/Intel
  Jan 30 12:43:51 aries kernel: [ 6649.263033] drm/i915 developers can then 
reassign to the right component if it's not a kernel issue.
  Jan 30 12:43:51 aries kernel: [ 6649.263034] The GPU crash dump is required 
to analyze GPU hangs, so please always attach it.
  Jan 30 12:43:51 aries kernel: [ 6649.263034] GPU crash dump saved to 
/sys/class/drm/card0/error
  Jan 30 12:43:51 aries kernel: [ 6649.264039] i915 0000:00:02.0: Resetting 
rcs0 for hang on rcs0
  Jan 30 12:43:51 aries kernel: [ 6649.264778] [drm:gen8_reset_engines [i915]] 
*ERROR* rcs0 reset request timed out: {request: 00000001, RESET_CTL: 00000001}
  Jan 30 12:43:51 aries kernel: [ 6649.265046] i915 0000:00:02.0: Resetting 
chip for hang on rcs0
  Jan 30 12:43:51 aries kernel: [ 6649.267018] [drm:gen8_reset_engines [i915]] 
*ERROR* rcs0 reset request timed out: {request: 00000001, RESET_CTL: 00000001}
  Jan 30 12:43:51 aries kernel: [ 6649.267764] [drm:gen8_reset_engines [i915]] 
*ERROR* rcs0 reset request timed out: {request: 00000001, RESET_CTL: 00000001}
  Jan 30 12:43:59 aries kernel: [ 6657.262680] i915 0000:00:02.0: Resetting 
rcs0 for hang on rcs0
  Jan 30 12:44:01 aries kernel: [ 6659.246609] i915 0000:00:02.0: Resetting 
rcs0 for hang on rcs0
  Jan 30 12:44:09 aries kernel: [ 6667.246324] i915 0000:00:02.0: Resetting 
rcs0 for hang on rcs0
  Jan 30 12:44:09 aries kernel: [ 6667.494008] show_signal_msg: 20 callbacks 
suppressed
  Jan 30 12:44:09 aries kernel: [ 6667.494011] GpuWatchdog[6827]: segfault at 0 
ip 000055fd01917ded sp 00007f63043cc480 error 6 in chrome[55fcfd9dc000+7171000]
  Jan 30 12:44:09 aries kernel: [ 6667.494017] Code: 48 c1 c9 03 48 81 f9 af 00 
00 00 0f 87 c9 00 00 00 48 8d 15 a9 5a 9c fb f6 04 11 20 0f 84 b8 00 00 00 be 
01 00 00 00 ff 50 30 <c7> 04 25 00 00 00 00 37 13 00 00 c6 05 c1 6d a4 03 01 80 
7d 8f 00
  Jan 30 12:44:23 aries kernel: [ 6681.265885] i915 0000:00:02.0: Resetting 
rcs0 for hang on rcs0
  Jan 30 12:44:25 aries kernel: [ 6683.245838] i915 0000:00:02.0: Resetting 
rcs0 for hang on rcs0
  Jan 30 12:44:27 aries kernel: [ 6685.261749] i915 0000:00:02.0: Resetting 
rcs0 for hang on rcs0
  Jan 30 12:44:29 aries kernel: [ 6687.245641] i915 0000:00:02.0: Resetting 
rcs0 for hang on rcs0
  Jan 30 12:44:31 aries kernel: [ 6689.261618] i915 0000:00:02.0: Resetting 
rcs0 for hang on rcs0
  Jan 30 12:44:51 aries kernel: [ 6709.260901] i915 0000:00:02.0: Resetting 
rcs0 for hang on rcs0

  ProblemType: Bug
  DistroRelease: Ubuntu 20.04
  Package: linux-image-5.4.0-12-generic 5.4.0-12.15
  ProcVersionSignature: Ubuntu 5.4.0-12.15-generic 5.4.8
  Uname: Linux 5.4.0-12-generic x86_64
  NonfreeKernelModules: zfs zunicode zavl icp zcommon znvpair
  ApportVersion: 2.20.11-0ubuntu15
  Architecture: amd64
  CurrentDesktop: ubuntu:GNOME
  Date: Thu Jan 30 12:51:24 2020
  InstallationDate: Installed on 2018-06-18 (591 days ago)
  InstallationMedia: Ubuntu 18.04 LTS "Bionic Beaver" - Release amd64 (20180426)
  ProcEnviron:
   TERM=xterm-256color
   PATH=(custom, no user)
   XDG_RUNTIME_DIR=<set>
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  SourcePackage: linux-signed-5.4
  UpgradeStatus: Upgraded to focal on 2020-01-22 (8 days ago)
  --- 
  ProblemType: Bug
  ApportVersion: 2.20.11-0ubuntu16
  Architecture: amd64
  AudioDevicesInUse:
   USER        PID ACCESS COMMAND
   /dev/snd/controlC0:  dpb       115653 F.... pulseaudio
  CurrentDesktop: ubuntu:GNOME
  DistroRelease: Ubuntu 20.04
  InstallationDate: Installed on 2018-06-18 (604 days ago)
  InstallationMedia: Ubuntu 18.04 LTS "Bionic Beaver" - Release amd64 (20180426)
  Lsusb:
   Bus 002 Device 001: ID 1d6b:0003 Linux Foundation 3.0 root hub
   Bus 001 Device 004: ID 138a:0097 Validity Sensors, Inc. 
   Bus 001 Device 003: ID 04f2:b5ce Chicony Electronics Co., Ltd Integrated 
Camera
   Bus 001 Device 002: ID 8087:0a2b Intel Corp. 
   Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
  MachineType: LENOVO 20HRCTO1WW
  NonfreeKernelModules: zfs zunicode zavl icp zcommon znvpair
  Package: linux (not installed)
  ProcEnviron:
   TERM=xterm-256color
   PATH=(custom, no user)
   XDG_RUNTIME_DIR=<set>
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  ProcFB: 0 i915drmfb
  ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-5.4.0-14-generic 
root=UUID=fa64d67d-26bf-4c42-a12f-c45b6ea5117c ro quiet splash vt.handoff=7
  ProcVersionSignature: Ubuntu 5.4.0-14.17-generic 5.4.18
  RelatedPackageVersions:
   linux-restricted-modules-5.4.0-14-generic N/A
   linux-backports-modules-5.4.0-14-generic  N/A
   linux-firmware                            1.186
  Tags:  focal
  Uname: Linux 5.4.0-14-generic x86_64
  UpgradeStatus: Upgraded to focal on 2020-01-22 (21 days ago)
  UserGroups: adm cdrom dip libvirt lpadmin lxd netdev plugdev sambashare sudo 
video
  _MarkForUpload: True
  dmi.bios.date: 11/25/2019
  dmi.bios.vendor: LENOVO
  dmi.bios.version: N1MET59W (1.44 )
  dmi.board.asset.tag: Not Available
  dmi.board.name: 20HRCTO1WW
  dmi.board.vendor: LENOVO
  dmi.board.version: Not Defined
  dmi.chassis.asset.tag: No Asset Information
  dmi.chassis.type: 10
  dmi.chassis.vendor: LENOVO
  dmi.chassis.version: None
  dmi.modalias: 
dmi:bvnLENOVO:bvrN1MET59W(1.44):bd11/25/2019:svnLENOVO:pn20HRCTO1WW:pvrThinkPadX1Carbon5th:rvnLENOVO:rn20HRCTO1WW:rvrNotDefined:cvnLENOVO:ct10:cvrNone:
  dmi.product.family: ThinkPad X1 Carbon 5th
  dmi.product.name: 20HRCTO1WW
  dmi.product.sku: LENOVO_MT_20HR_BU_Think_FM_ThinkPad X1 Carbon 5th
  dmi.product.version: ThinkPad X1 Carbon 5th
  dmi.sys.vendor: LENOVO

To manage notifications about this bug go to:
https://bugs.launchpad.net/linux/+bug/1861395/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to