Disco / Testing
=====

* Using the linux-hwe-5.0 from "Disco" (EOL) on Bionic for the 5.0
kernel.

modified
--------

$ uname -rv
5.0.0-57-generic #61~18.04.1+lp1867916.1 SMP Mon Jul 6 19:27:05 -03 2020

$ sudo make-bcache --bdev $DEV --block 8k
[  109.818171] bcache: bcache_device_init() bcache0: sb/logical block size 
(8192) greater than page size (4096) falling back to device logical block size 
(512)
[  109.822055] bcache: register_bdev() registered backing device loop0

original
--------

$ uname -rv
5.0.0-57-generic #61~18.04.1-Ubuntu SMP Mon Jul 6 09:40:52 UTC 2020

$ sudo make-bcache --bdev $DEV --block 8k
[  112.148300] bcache: register_bdev() registered backing device loop0
[  112.150575] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000008
[  112.153000] #PF error: [normal kernel read fault]
[  112.154474] PGD 800000012a5df067 P4D 800000012a5df067 PUD 137558067 PMD 0
[  112.156614] Oops: 0000 [#1] SMP PTI
[  112.157742] CPU: 1 PID: 1649 Comm: bcache-register Not tainted 
5.0.0-57-generic #61~18.04.1-Ubuntu
[  112.161386] RIP: 0010:create_empty_buffers+0x29/0x110
[  112.162321] Code: 00 0f 1f 44 00 00 55 48 89 e5 41 55 41 54 53 49 89 d5 ba 
01 00 00 00 48 89 fb e8 72 fe ff ff 49 89 c4 48 89 c2 eb 03 4$ 89 ca <48> 8b 4a 
08 4c 09 2a 48 85 c9 75 f1 4c 89 62 08 48 8b 43 18 48 8d
[  112.165333] RSP: 0018:ffff9fac40dd77f8 EFLAGS: 00010286
[  112.166198] RAX: 0000000000000000 RBX: ffffc8b744a73700 RCX: ffff9051b779d000
[  112.167430] RDX: 0000000000000000 RSI: ffff9051b779d000 RDI: ffffc8b744a73700
[  112.168767] RBP: ffff9fac40dd7810 R08: dead0000000000ff R09: 0000000000000003
[  112.169992] R10: 0000000000000000 R11: 00003748bb58c8ff R12: 0000000000000000
[  112.171515] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000200
[  112.172649] FS:  00007f4344d58700(0000) GS:ffff9051bba80000(0000) 
knlGS:0000000000000000
[  112.174052] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  112.175056] CR2: 0000000000000008 CR3: 0000000129e2a000 CR4: 00000000000006e0
[  112.176259] Call Trace:
[  112.176773]  create_page_buffers+0x52/0x60
[  112.177525]  block_read_full_page+0x4e/0x3c0
[  112.178304]  ? check_disk_change+0x70/0x70
[  112.179098]  ? count_shadow_nodes+0x130/0x130
[  112.179881]  blkdev_readpage+0x18/0x20
[  112.180571]  do_read_cache_page+0x37b/0x790
[  112.181324]  ? blkdev_writepages+0x10/0x10
[  112.182064]  ? get_page_from_freelist+0x154e/0x1560
[  112.182924]  ? update_load_avg+0x8b/0x5f0
[  112.183657]  read_cache_page+0x12/0x20
[  112.184354]  read_dev_sector+0x2d/0xe0
[  112.185041]  read_lba+0x130/0x220
[  112.185665]  efi_partition+0x131/0x770
[  112.186360]  ? string+0x60/0x90
[  112.186971]  ? vsnprintf+0xfb/0x510
[  112.187625]  ? snprintf+0x45/0x70
[  112.188252]  ? is_gpt_valid.part.7+0x420/0x420
[  112.189056]  check_partition+0x13f/0x250
[  112.189759]  ? is_gpt_valid.part.7+0x420/0x420
[  112.190542]  ? check_partition+0x13f/0x250
[  112.191292]  rescan_partitions+0xaf/0x360
[  112.192015]  bdev_disk_changed+0x5a/0x60
[  112.192723]  __blkdev_get+0x354/0x560
[  112.193440]  ? inode_init_always+0x131/0x1f0
[  112.194365]  blkdev_get+0x131/0x340
[  112.195106]  ? wake_up_bit+0x42/0x50
[  112.195867]  ? unlock_new_inode+0x4f/0x80
[  112.196744]  ? bdget+0x109/0x120
[  112.197436]  __device_add_disk+0x3d6/0x4f0
[  112.198315]  device_add_disk+0x13/0x20
[  112.199132]  bch_cached_dev_run.part.32+0x44/0x1b0 [bcache]
[  112.200248]  register_bcache+0xa28/0x1120 [bcache]
[  112.201214]  kobj_attr_store+0x12/0x20
[  112.201977]  ? kobj_attr_store+0x12/0x20
[  112.202802]  sysfs_kf_write+0x3c/0x50
[  112.203584]  kernfs_fop_write+0x13e/0x1c0
[  112.204458]  __vfs_write+0x1b/0x40
[  112.205164]  vfs_write+0xb1/0x1a0
[  112.205860]  ksys_write+0x5c/0xe0
[  112.206578]  __x64_sys_write+0x1a/0x20
[  112.207388]  do_syscall_64+0x5a/0x120
[  112.208157]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
[  112.209206] RIP: 0033:0x7f43448682c0
[  112.209956] Code: 73 01 c3 48 8b 0d d8 cb 2c 00 f7 d8 64 89 01 48 83 c8 ff 
c3 66 0f 1f 44 00 00 83 3d 89 24 2d 00 00 75 10 b8 01 00 00 00
 0f 05 <48> 3d 01 f0 ff ff 73 31 c3 48 83 ec 08 e8 fe dd 01 00 48 89 04 24
[  112.213375] RSP: 002b:00007ffe6225fcf8 EFLAGS: 00000246 ORIG_RAX: 
0000000000000001
[  112.214816] RAX: ffffffffffffffda RBX: 000000000000000b RCX: 00007f43448682c0
[  112.216142] RDX: 000000000000000b RSI: 0000000000dc9010 RDI: 0000000000000003
[  112.217477] RBP: 0000000000dc9010 R08: 0000000000000000 R09: 000000000000000b
[  112.218831] R10: 000000000000000a R11: 0000000000000246 R12: 000000000000000b
[  112.220143] R13: 0000000000000001 R14: 00007ffe6225fd70 R15: 0000000000000000
[  112.221450] Modules linked in: bcache crc64 isofs kvm_intel kvm irqbypass 
input_leds joydev serio_raw ib_iser rdma_cm iw_cm ib_cm ib_core
 iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi autofs4 btrfs 
zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq asy
nc_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear psmouse 
virtio_net net_failover virtio_blk failover floppy
[  112.227818] CR2: 0000000000000008
[  112.228591] ---[ end trace c425c5c71d7443c3 ]---
[  112.229535] RIP: 0010:create_empty_buffers+0x29/0x110
[  112.230501] Code: 00 0f 1f 44 00 00 55 48 89 e5 41 55 41 54 53 49 89 d5 ba 
01 00 00 00 48 89 fb e8 72 fe ff ff 49 89 c4 48 89 c2 eb 03 48
 89 ca <48> 8b 4a 08 4c 09 2a 48 85 c9 75 f1 4c 89 62 08 48 8b 43 18 48 8d
[  112.233933] RSP: 0018:ffff9fac40dd77f8 EFLAGS: 00010286
[  112.234912] RAX: 0000000000000000 RBX: ffffc8b744a73700 RCX: ffff9051b779d000
[  112.236245] RDX: 0000000000000000 RSI: ffff9051b779d000 RDI: ffffc8b744a73700
[  112.237589] RBP: ffff9fac40dd7810 R08: dead0000000000ff R09: 0000000000000003
[  112.239009] R10: 0000000000000000 R11: 00003748bb58c8ff R12: 0000000000000000
[  112.240337] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000200
[  112.241646] FS:  00007f4344d58700(0000) GS:ffff9051bba80000(0000) 
knlGS:0000000000000000
[  112.243230] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  112.244303] CR2: 0000000000000008 CR3: 0000000129e2a000 CR4: 00000000000006e0

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1867916

Title:
  Regression in kernel 4.15.0-91 causes kernel panic with Bcache

Status in Linux:
  Confirmed
Status in linux package in Ubuntu:
  Fix Committed
Status in linux source package in Xenial:
  In Progress
Status in linux source package in Bionic:
  In Progress
Status in linux source package in Eoan:
  In Progress
Status in linux source package in Focal:
  In Progress
Status in linux source package in Groovy:
  Won't Fix

Bug description:
  [Impact]

   * Users of bcache who manually specified a block size
     greater than the page size when creating the device
     with 'make-bcache' started to hit a kernel BUG/oops
     after kernel upgrades.  (This is not widely used.)

   * The issue has been exposed with commit ad6bf88a6c19
     ("block: fix an integer overflow in logical block size")
     because it increased the range of values accepted as
     logical block size, which used to overflow to zero,
     and thus receive a default of 512 via block layer.

   * The issue existed previously, but with fewer values
     exposed (e.g. 8k, 16k, 32k); the regression reports
     happened with larger values (512k) for RAID stripes.

  [Fix]

   * The upstream commit dcacbc1242c7 ("bcache: check and
     adjust logical block size for backing devices") checks
     the block size and adjusts it if needed, to the value
     of the underlying device's logical block size.

   * It is merged as of v5.8-rcN, and sent to v5.7 stable.

  [Test Case]

   * Run make-bcache with block size greater than page size.
     $ sudo make-bcache --bdev $DEV --block 8k

   * Expected results: bcache device registered; no BUG/oops.
   * Details steps on comment #43.

  [Regression Potential]

   * Restricted to users who specify a bcache block size
     greater than page size.

   * Regressions could theoretically manifest on bcache
     device probe/register, if the underlying device's
     logical block size for whatever triggers issues not
     seen previously with the overflow/default 512 bytes.

  [Other Info]

   * Unstable has the patch on both master/master-5.7.
   * Groovy should get it on rebase.

  [Original Bug Description]
  After upgrading from kernel 4.15.0-88 to 4.15.0-91 one of our systems does 
not boot any longer. It always crashes during boot with a kernel panic.

  I suspect that this crash might be related to Bcache because this is
  the only one of our systems where we use Bcache and the kernel panic
  appears right after Bcache initialization.

  I already checked that this bug still exists in the 4.15.0-92.93
  kernel from proposed.

  Unfortunately, I cannot do a bisect because this is a critical
  production system and we do not have any other system with a similar
  configuration.

  I attached a screenshot with the trace of the kernel panic.

  The last message that appears before the kernel panic (or rather the
  last one that I can see - there is a rather long pause between that
  message and the panic and I cannot scroll up far enough to ensure that
  there are no other messages in between) is:

  bcache: register_bcache() error /dev/dm-0: device already registered

  When booting with kernel 4.15.0-88 that does not have this problem,
  the next message is

  bcache: register_bcache() error /dev/dm-12: device already registered
  (emitting change event)

  After that the next message is:

  Begin: Loading essential drivers ... done

  This message also appears after the kernel panic, but the boot process
  stalls and the system can only be recovered by doing a hardware reset.

  ProblemType: Bug
  DistroRelease: Ubuntu 18.04
  Package: linux-image-4.15.0-88-generic 4.15.0-88.88
  ProcVersionSignature: Ubuntu 4.15.0-88.88-generic 4.15.18
  Uname: Linux 4.15.0-88-generic x86_64
  AlsaDevices:
   total 0
   crw-rw---- 1 root audio 116,  1 Mar 17 21:08 seq
   crw-rw---- 1 root audio 116, 33 Mar 17 21:08 timer
  AplayDevices: Error: [Errno 2] No such file or directory: 'aplay': 'aplay'
  ApportVersion: 2.20.9-0ubuntu7.11
  Architecture: amd64
  ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord': 
'arecord'
  AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', 
'/dev/snd/timer'] failed with exit code 1:
  Date: Wed Mar 18 12:55:18 2020
  HibernationDevice: RESUME=UUID=40512ea2-9fce-40f5-8362-5daf955cc26a
  InstallationDate: Installed on 2013-07-02 (2450 days ago)
  InstallationMedia: Ubuntu-Server 12.04.2 LTS "Precise Pangolin" - Release 
amd64 (20130214)
  MachineType: HP ProLiant DL160 G6
  PciMultimedia:

  ProcFB: 0 mgadrmfb
  ProcKernelCmdLine: BOOT_IMAGE=/vmlinuz-4.15.0-88-generic 
root=/dev/mapper/vg0-root ro nosmt nomdmonddf nomdmonisw nomdmonddf nomdmonisw 
nomdmonddf nomdmonisw nomdmonddf nomdmonisw nomdmonddf nomdmonisw
  RelatedPackageVersions:
   linux-restricted-modules-4.15.0-88-generic N/A
   linux-backports-modules-4.15.0-88-generic  N/A
   linux-firmware                             1.173.16
  RfKill: Error: [Errno 2] No such file or directory: 'rfkill': 'rfkill'
  SourcePackage: linux
  UpgradeStatus: Upgraded to bionic on 2018-09-23 (541 days ago)
  dmi.bios.date: 11/06/2009
  dmi.bios.vendor: HP
  dmi.bios.version: O33
  dmi.chassis.asset.tag: 0191525
  dmi.chassis.type: 23
  dmi.chassis.vendor: HP
  dmi.modalias: 
dmi:bvnHP:bvrO33:bd11/06/2009:svnHP:pnProLiantDL160G6:pvr:cvnHP:ct23:cvr:
  dmi.product.name: ProLiant DL160 G6
  dmi.sys.vendor: HP

To manage notifications about this bug go to:
https://bugs.launchpad.net/linux/+bug/1867916/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to