** Changed in: ubuntu-realtime
       Status: In Progress => Fix Released

** No longer affects: kdump-tools (Ubuntu)

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to kdump-tools in Ubuntu.
Matching subscriptions: Maintainer
https://bugs.launchpad.net/bugs/1995270

Title:
  Kernel crash dump not getting generated

Status in ubuntu-realtime:
  Fix Released

Bug description:
  I have to enable the kernel crash dump feature to investigate certain
  system hang issues. My intention is to enable the kernel crash dump
  feature (https://ubuntu.com/server/docs/kernel-crash-dump)  along with
  kernel.hung_task_panic and kernel.hung_task_timeout_secs parameters
  (in /etc/sysctl.conf), so that a kernel vmcore is generated when
  kernel detects a hung task for more than 5 minutes. I will be then
  able to send the vmcore files to Kernel engineers in Canonical for
  investigation.

  However, While trying to enable kernel crash dump, I'm unable to get a
  vmcore generate using the test procedure described in "Testing the
  Crash Dump Mechanism" section in:
  https://ubuntu.com/server/docs/kernel-crash-dump

  When I run the command: "echo c > /proc/sysrq-trigger", all I see is a
  backtrace (image attached). I do not see the print .. "Begin: Saving
  vmcore from kernel crash ..."

  Upon system reboot there is no vmcore file under the /var/crash
  directory. could you please help pointing out what steps i'm missing?

  
  Here are some system information:
  =======================================

  Last login: Mon Oct 31 11:35:21 2022
  root@vran-server-1:~# cat /proc/cmdline
  BOOT_IMAGE=/vmlinuz-5.15.0-1025-realtime 
root=/dev/mapper/ubuntu--vg-ubuntu--lv ro rhgb quiet skew_tick=1 nohz=on 
nohz_full=2-23,26-47,50-71,74-95 rcu_nocbs=2-23,26-47,50-71,74-95 
intel_pstate=disable nosoftlockup intel_iommu=on iommu=pt 
usbcore.autosuspend=-1 selinux=0 enforcing=0 nmi_watchdog=0 softlockup_panic=0 
audit=0 cgroup_memory=1 cgroup_enable=memory mce=off idle=poll 
default_hugepagesz=1G skew_tick=1 idle=poll processor.max_cstate=1 
intel_idle.max_cstate=0 rcu_nocb_poll kthread_cpus=0,1,72,73,48,49,24,25 
irqaffinity=0,1,72,73,48,49,24,25 nosoftlockup tsc=nowatchdog 
isolcpus=managed_irq,domain,2-23,26-47,50-71,74-95 
systemd.cpu_affinity=0,1,72,73,48,49,24,25 cgroup.memory=nokmem 
crashkernel=512M-:256M

  root@vran-server-1:~# uname -a
  Linux vran-server-1 5.15.0-1025-realtime #26 SMP PREEMPT_RT Thu Oct 20 
18:14:07 UTC 2022 x86_64 x86_64 x86_64 GNU/Linux

  root@vran-server-1:~# kdump-config show
  DUMP_MODE:              kdump
  USE_KDUMP:              1
  KDUMP_COREDIR:          /var/crash
  crashkernel addr: 0x49000000
     /var/lib/kdump/vmlinuz: symbolic link to /boot/vmlinuz-5.15.0-1025-realtime
  kdump initrd: 
     /var/lib/kdump/initrd.img: symbolic link to 
/var/lib/kdump/initrd.img-5.15.0-1025-realtime
  current state:    ready to kdump

  kexec command:
    /sbin/kexec -p --command-line="BOOT_IMAGE=/vmlinuz-5.15.0-1025-realtime 
root=/dev/mapper/ubuntu--vg-ubuntu--lv ro rhgb quiet skew_tick=1 nohz=on 
nohz_full=2-23,26-47,50-71,74-95 rcu_nocbs=2-23,26-47,50-71,74-95 
intel_pstate=disable nosoftlockup intel_iommu=on iommu=pt 
usbcore.autosuspend=-1 selinux=0 enforcing=0 nmi_watchdog=0 softlockup_panic=0 
audit=0 cgroup_memory=1 cgroup_enable=memory mce=off idle=poll 
default_hugepagesz=1G skew_tick=1 idle=poll processor.max_cstate=1 
intel_idle.max_cstate=0 rcu_nocb_poll kthread_cpus=0,1,72,73,48,49,24,25 
irqaffinity=0,1,72,73,48,49,24,25 nosoftlockup tsc=nowatchdog 
isolcpus=managed_irq,domain,2-23,26-47,50-71,74-95 
systemd.cpu_affinity=0,1,72,73,48,49,24,25 cgroup.memory=nokmem reset_devices 
systemd.unit=kdump-tools-dump.service nr_cpus=1 irqpoll nousb" 
--initrd=/var/lib/kdump/initrd.img /var/lib/kdump/vmlinuz


  
  root@vran-server-1:~# cat /etc/default/kexec
  # Defaults for kexec initscript
  # sourced by /etc/init.d/kexec and /etc/init.d/kexec-load

  # Load a kexec kernel (true/false)
  LOAD_KEXEC=true

  # Kernel and initrd image
  KERNEL_IMAGE="/vmlinuz"
  INITRD="/initrd.img"

  # If empty, use current /proc/cmdline
  APPEND=""

  # Load the default kernel from grub config (true/false)
  USE_GRUB_CONFIG=false


  root@vran-server-1:~# cat /etc/default/kdump-tools
  # kdump-tools configuration
  # ---------------------------------------------------------------------------
  # USE_KDUMP - controls kdump will be configured
  #     0 - kdump kernel will not be loaded
  #     1 - kdump kernel will be loaded and kdump is configured
  #
  USE_KDUMP=1

  
  # ---------------------------------------------------------------------------
  # Kdump Kernel:
  # KDUMP_KERNEL - A full pathname to a kdump kernel.
  # KDUMP_INITRD - A full pathname to the kdump initrd (if used).
  #     If these are not set, kdump-config will try to use the current kernel
  #     and initrd if it is relocatable.  Otherwise, you will need to specify
  #     these manually.
  KDUMP_KERNEL=/var/lib/kdump/vmlinuz
  KDUMP_INITRD=/var/lib/kdump/initrd.img

  
  # ---------------------------------------------------------------------------
  # vmcore Handling:
  # KDUMP_COREDIR - local path to save the vmcore to.
  # KDUMP_FAIL_CMD - This variable can be used to cause a reboot or
  #     start a shell if saving the vmcore fails.  If not set, "reboot -f"
  #     is the default.
  #     Example - start a shell if the vmcore copy fails:
  #         KDUMP_FAIL_CMD="echo 'makedumpfile FAILED.'; /bin/bash; reboot -f"
  # KDUMP_DUMP_DMESG - This variable controls if the dmesg buffer is dumped.
  #     If unset or set to 1, the dmesg buffer is dumped. If set to 0, the dmesg
  #     buffer is not dumped.
  # KDUMP_NUM_DUMPS - This variable controls how many dump files are kept on
  #     the machine to prevent running out of disk space. If set to 0 or unset,
  #     the variable is ignored and no dump files are automatically purged.
  # KDUMP_COMPRESSION - Compress the dumpfile. No compression is used by 
default.
  #     Supported compressions: bzip2, gzip, lz4, xz
  KDUMP_COREDIR="/var/crash"
  #KDUMP_FAIL_CMD="reboot -f"
  #KDUMP_DUMP_DMESG=
  #KDUMP_NUM_DUMPS=
  #KDUMP_COMPRESSION=

  
  # ---------------------------------------------------------------------------
  # Makedumpfile options:
  # MAKEDUMP_ARGS - extra arguments passed to makedumpfile (8).  The default,
  #     if unset, is to pass '-c -d 31' telling makedumpfile to use compression
  #     and reduce the corefile to in-use kernel pages only.
  #MAKEDUMP_ARGS="-c -d 31"

  
  # ---------------------------------------------------------------------------
  # Kexec/Kdump args
  # KDUMP_KEXEC_ARGS - Additional arguments to the kexec command used to load
  #     the kdump kernel
  #     Example - Use this option on x86 systems with PAE and more than
  #     4 gig of memory:
  #         KDUMP_KEXEC_ARGS="--elf64-core-headers"
  # KDUMP_CMDLINE - The default is to use the contents of /proc/cmdline.
  #     Set this variable to override /proc/cmdline.
  # KDUMP_CMDLINE_APPEND - Additional arguments to append to the command line
  #     for the kdump kernel.  If unset, it defaults to
  #     "reset_devices systemd.unit=kdump-tools-dump.service nr_cpus=1 irqpoll 
nousb"
  #KDUMP_KEXEC_ARGS=""
  #KDUMP_CMDLINE=""
  #KDUMP_CMDLINE_APPEND="reset_devices systemd.unit=kdump-tools-dump.service 
nr_cpus=1 irqpoll nousb"

  # ---------------------------------------------------------------------------
  # Architecture specific Overrides:

  # ---------------------------------------------------------------------------
  # Remote dump facilities:
  # HOSTTAG - Select if hostname of IP address will be used as a prefix to the
  #           timestamped directory when sending files to the remote server.
  #           'ip' is the default.
  #HOSTTAG="hostname|[ip]"

  # NFS -     Hostname and mount point of the NFS server configured to receive
  #           the crash dump. The syntax must be {HOSTNAME}:{MOUNTPOINT}
  #           (e.g. remote:/var/crash)
  # NFS_TIMEO - Timeout before NFS retries a request. See man nfs(5) for 
details.
  # NFS_RETRANS - Number of times NFS client retries a request. See man nfs(5) 
for details.
  #NFS="<nfs mount>"
  #NFS_TIMEO="600"
  #NFS_RETRANS="3"

  # FTP - Hostname and path of the FTP server configured to receive the crash 
dump.
  #       The syntax is {HOSTNAME}[:{PATH}] with PATH defaulting to /.
  # FTP_USER - FTP username. A anonomous upload will be used if not set.
  # FTP_PASSWORD - password for the FTP user
  # FTP_PORT=21 - FTP port. Port 21 will be used by default.
  #FTP="<server>:<path>"
  #FTP_USER=""
  #FTP_PASSWORD=""
  #FTP_PORT=21

  # SSH - username and hostname of the remote server that will receive the dump
  #       and dmesg files.
  # SSH_KEY - Full path of the ssh private key to be used to login to the remote
  #           server. use kdump-config propagate to send the public key to the
  #           remote server
  #SSH="<user at server>"
  #SSH_KEY="<path>"

  
  root@vran-server-1:~# 
  root@vran-server-1:~# cat /proc/sys/kernel/sysrq
  176

  
  root@vran-server-1:~# dmesg | grep -i crash
  [    0.000000] Command line: BOOT_IMAGE=/vmlinuz-5.15.0-1025-realtime 
root=/dev/mapper/ubuntu--vg-ubuntu--lv ro rhgb quiet skew_tick=1 nohz=on 
nohz_full=2-23,26-47,50-71,74-95 rcu_nocbs=2-23,26-47,50-71,74-95 
intel_pstate=disable nosoftlockup intel_iommu=on iommu=pt 
usbcore.autosuspend=-1 selinux=0 enforcing=0 nmi_watchdog=0 softlockup_panic=0 
audit=0 cgroup_memory=1 cgroup_enable=memory mce=off idle=poll 
default_hugepagesz=1G skew_tick=1 idle=poll processor.max_cstate=1 
intel_idle.max_cstate=0 rcu_nocb_poll kthread_cpus=0,1,72,73,48,49,24,25 
irqaffinity=0,1,72,73,48,49,24,25 nosoftlockup tsc=nowatchdog 
isolcpus=managed_irq,domain,2-23,26-47,50-71,74-95 
systemd.cpu_affinity=0,1,72,73,48,49,24,25 cgroup.memory=nokmem 
crashkernel=512M-:256M
  [    0.012353] Reserving 256MB of memory at 1168MB for crashkernel (System 
RAM: 195278MB)
  [    0.551413] Kernel command line: BOOT_IMAGE=/vmlinuz-5.15.0-1025-realtime 
root=/dev/mapper/ubuntu--vg-ubuntu--lv ro rhgb quiet skew_tick=1 nohz=on 
nohz_full=2-23,26-47,50-71,74-95 rcu_nocbs=2-23,26-47,50-71,74-95 
intel_pstate=disable nosoftlockup intel_iommu=on iommu=pt 
usbcore.autosuspend=-1 selinux=0 enforcing=0 nmi_watchdog=0 softlockup_panic=0 
audit=0 cgroup_memory=1 cgroup_enable=memory mce=off idle=poll 
default_hugepagesz=1G skew_tick=1 idle=poll processor.max_cstate=1 
intel_idle.max_cstate=0 rcu_nocb_poll kthread_cpus=0,1,72,73,48,49,24,25 
irqaffinity=0,1,72,73,48,49,24,25 nosoftlockup tsc=nowatchdog 
isolcpus=managed_irq,domain,2-23,26-47,50-71,74-95 
systemd.cpu_affinity=0,1,72,73,48,49,24,25 cgroup.memory=nokmem 
crashkernel=512M-:256M
  [    3.536582] pstore: Using crash dump compression: deflate
  [    3.992799] megaraid_sas 0000:5e:00.0: firmware crash dump   : yes

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu-realtime/+bug/1995270/+subscriptions


-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to