------- Comment From dieg...@br.ibm.com 2018-01-24 07:29 EDT-------
Hi,

The first problem seems to be not occurring anymore (the segmentation
fault problem).

Although we are still facing the second problem (in the trace below), we
created a new bug report for it.

root@tuletapio2-lp3:~/linux/tools/testing/selftests/memory-hotplug# uname -a
Linux tuletapio2-lp3 4.13.0-16-generic #19~lp1706247 SMP Wed Nov 1 23:21:53 UTC 
2017 ppc64le ppc64le ppc64le GNU/Linux

[  101.664761] kernel BUG at 
/home/jsalisbury/bugs/lp1706247/artful/ubuntu-artful/mm/slub.c:4034!
[  101.664781] Oops: Exception in kernel mode, sig: 5 [#1]
[  101.664785] SMP NR_CPUS=2048
[  101.664785] NUMA
[  101.664788] pSeries
[  101.664792] Modules linked in: btrfs xor raid6_pq vmx_crypto 
crct10dif_vpmsum ip_tables x_tables autofs4 ibmveth ibmvscsi crc32c_vpmsum
[  101.664807] CPU: 60 PID: 2151 Comm: sh Not tainted 4.13.0-16-generic 
#19~lp1706247
[  101.664812] task: c000000fd2755900 task.stack: c000000fd27b8000
[  101.664816] NIP: c0000000003630a4 LR: c0000000003630b8 CTR: c000000000150280
[  101.664820] REGS: c000000fd27bb760 TRAP: 0700   Not tainted  
(4.13.0-16-generic)
[  101.664824] MSR: 800000000282b033 <SF,VEC,VSX,EE,FP,ME,IR,DR,RI,LE>
[  101.664829]   CR: 24002828  XER: 00000000
[  101.664835] CFAR: c0000000003630c4 SOFTE: 1
[  101.664835] GPR00: c0000000003630b8 c000000fd27bb9e0 c000000001603300 
0000000000000001
[  101.664835] GPR04: c0000007fc010000 c0000007ffff3d80 f000000001ff03a0 
0000000000000100
[  101.664835] GPR08: c0000007ffff3c80 c0000007fc03fd90 0000000000000001 
00000000ffffffff
[  101.664835] GPR12: 0000000024002822 c00000000fae7600 f000000000000000 
0000000000000000
[  101.664835] GPR16: 00000000003fffff 0000000000140000 c0000000018e9400 
f000000004ffffc0
[  101.664835] GPR20: 0000000000000000 c000000001633b00 0000000000000002 
00000000ffffb370
[  101.664835] GPR24: 0000000000000001 0000000000001000 c000000fd27bbb70 
0000000000000000
[  101.664835] GPR28: c0000000014cadd8 0000000000000010 c0000000014cae38 
c0000007fc03fd80
[  101.664883] NIP [c0000000003630a4] slab_memory_callback+0x1d4/0x320
[  101.664887] LR [c0000000003630b8] slab_memory_callback+0x1e8/0x320
[  101.664890] Call Trace:
[  101.664893] [c000000fd27bb9e0] [c0000000003630b8] 
slab_memory_callback+0x1e8/0x320 (unreliable)
[  101.664901] [c000000fd27bba40] [c00000000012bacc] 
notifier_call_chain+0x9c/0x110
[  101.664906] [c000000fd27bba90] [c00000000012c704] 
blocking_notifier_call_chain+0x64/0xa0
[  101.664912] [c000000fd27bbad0] [c000000000824340] memory_notify+0x30/0x50
[  101.664917] [c000000fd27bbaf0] [c000000000367678] 
__offline_pages.constprop.6+0xaa8/0xb10
[  101.664923] [c000000fd27bbc30] [c0000000008241d8] 
memory_subsys_offline+0x68/0xd0
[  101.664928] [c000000fd27bbc60] [c0000000007f7b68] device_offline+0xc8/0x140
[  101.664933] [c000000fd27bbca0] [c000000000824040] store_mem_state+0x190/0x1b0
[  101.664938] [c000000fd27bbce0] [c0000000007f27dc] dev_attr_store+0x3c/0x60
[  101.664943] [c000000fd27bbd00] [c000000000469bd4] sysfs_kf_write+0x64/0x90
[  101.664948] [c000000fd27bbd20] [c00000000046883c] 
kernfs_fop_write+0x1ac/0x270
[  101.664954] [c000000fd27bbd70] [c00000000039d5cc] __vfs_write+0x3c/0x70
[  101.664959] [c000000fd27bbd90] [c00000000039f208] vfs_write+0xd8/0x220
[  101.664964] [c000000fd27bbde0] [c0000000003a1088] SyS_write+0x68/0x110
[  101.664969] [c000000fd27bbe30] [c00000000000b184] system_call+0x58/0x6c
[  101.664973] Instruction dump:
[  101.664976] ebfe0000 7fbff000 3bffff98 419e0160 fb610038 7bbd1f24 3b600000 
7d3fea14
[  101.664983] e8890150 2fa40000 419e001c e9440020 <0b0a0000> fb690150 3c62002f 
e863a198
[  101.664990] ---[ end trace b02bc4a6b8dd8da5 ]---
[  101.667737]

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1724120

Title:
  Ubuntu 16.04.3 - call traces occurs when memory-hotplug test is run
  with 16Gb hugepages configured

Status in The Ubuntu-power-systems project:
  In Progress
Status in linux package in Ubuntu:
  In Progress
Status in linux source package in Artful:
  In Progress

Bug description:
  Issue:

  Call traces occurs when memory-hotplug script is run with 16Gb
  hugepages configured.

  Environment:
  ppc64le PowerVM Lpar

  root@ltctuleta-lp1:~# uname -r
  4.4.0-34-generic

  root@ltctuleta-lp1:~# cat /proc/meminfo | grep -i huge
  AnonHugePages:         0 kB
  HugePages_Total:       2
  HugePages_Free:        2
  HugePages_Rsvd:        0
  HugePages_Surp:        0
  Hugepagesize:   16777216 kB

  root@ltctuleta-lp1:~# free -h
                total        used        free      shared  buff/cache   
available
  Mem:            85G         32G         52G         16M        193M         
52G
  Swap:           43G          0B         43G

  Steps to reproduce:
  1 - Download kernel source and enter to the directory- 
tools/testing/selftests/memory-hotplug/
  2 - Run  mem-on-off-test.sh script in it.

  System gives call traces like:

  offline_memory_expect_success 639: unexpected fail
  online-offline 668
  [   57.552964] Unable to handle kernel paging request for data at address 
0x00000028
  [   57.552977] Faulting instruction address: 0xc00000000029bc04
  [   57.552987] Oops: Kernel access of bad area, sig: 11 [#1]
  [   57.552992] SMP NR_CPUS=2048 NUMA pSeries
  [   57.553002] Modules linked in: btrfs xor raid6_pq pseries_rng sunrpc 
autofs4 ses enclosure nouveau bnx2x i2c_algo_bit ttm drm_kms_helper syscopyarea 
sysfillrect sysimgblt fb_sys_fops drm vxlan ip6_udp_tunnel ipr udp_tunnel 
rtc_generic mdio libcrc32c
  [   57.553050] CPU: 44 PID: 6518 Comm: mem-on-off-test Not tainted 
4.4.0-34-generic #53-Ubuntu
  [   57.553059] task: c00000072773c8e0 ti: c000000727780000 task.ti: 
c000000727780000
  [   57.553067] NIP: c00000000029bc04 LR: c00000000029bbdc CTR: 
c0000000001107f0
  [   57.553076] REGS: c000000727783770 TRAP: 0300   Not tainted  
(4.4.0-34-generic)
  [   57.553083] MSR: 8000000100009033 <SF,EE,ME,IR,DR,RI,LE>  CR: 24242882  
XER: 00000002
  [   57.553104] CFAR: c000000000008468 DAR: 0000000000000028 DSISR: 40000000 
SOFTE: 1 
  GPR00: c00000000029bbdc c0000007277839f0 c0000000015b5d00 0000000000000000 
  GPR04: 000000000029d000 0000000000000800 0000000000000000 f00000000a000001 
  GPR08: f00000000a700020 0000000000000008 c00000000185e270 c000000e7e000050 
  GPR12: 0000000000002200 c00000000e6ea200 000000000029d000 0000000022000000 
  GPR16: 1000000000000000 c0000000015e2200 000000000a700000 0000000000000000 
  GPR20: 0000000000010000 0000000000000100 0000000000000200 c0000000015f16d0 
  GPR24: c000000001876510 0000000000000000 0000000000000001 c000000001872a00 
  GPR28: 000000000029d000 f000000000000000 f00000000a700000 000000000029c000 
  [   57.553211] NIP [c00000000029bc04] dissolve_free_huge_pages+0x154/0x220
  [   57.553219] LR [c00000000029bbdc] dissolve_free_huge_pages+0x12c/0x220
  [   57.553226] Call Trace:
  [   57.553231] [c0000007277839f0] [c00000000029bbdc] 
dissolve_free_huge_pages+0x12c/0x220 (unreliable)
  [   57.553244] [c000000727783a80] [c0000000002dcbc8] 
__offline_pages.constprop.6+0x3f8/0x900
  [   57.553254] [c000000727783bd0] [c0000000006fbb38] 
memory_subsys_offline+0xa8/0x110
  [   57.553265] [c000000727783c00] [c0000000006d6424] 
device_offline+0x104/0x140
  [   57.553274] [c000000727783c40] [c0000000006fba80] 
store_mem_state+0x180/0x190
  [   57.553283] [c000000727783c80] [c0000000006d1e58] dev_attr_store+0x68/0xa0
  [   57.553293] [c000000727783cc0] [c000000000398110] sysfs_kf_write+0x80/0xb0
  [   57.553302] [c000000727783d00] [c000000000397028] 
kernfs_fop_write+0x188/0x200
  [   57.553312] [c000000727783d50] [c0000000002e190c] __vfs_write+0x6c/0xe0
  [   57.553321] [c000000727783d90] [c0000000002e2640] vfs_write+0xc0/0x230
  [   57.553329] [c000000727783de0] [c0000000002e367c] SyS_write+0x6c/0x110
  [   57.553339] [c000000727783e30] [c000000000009204] system_call+0x38/0xb4
  [   57.553346] Instruction dump:
  [   57.553351] 7e831836 4bfff991 e91e0028 e8fe0020 7d32e82a f9070008 f8e80000 
fabe0020 
  [   57.553366] fade0028 79294620 79291764 7d234a14 <e9030028> 3908ffff 
f9030028 81091458 
  [   57.553383] ---[ end trace 617f7bdd75bcfc10 ]---
  [   57.557133] 
  Segmentation fault

  The following commit IDs were built into a 4.10.0-37-generic #41 test
  kernel and verified to fix the problem:

  a525108cf1cc14651602d678da38fa627a76a724
  e1073d1e7920946ac4776a619cc40668b9e1401b
  40692eb5eea209c2dd55857f44b4e1d7206e91d6
  e24a1307ba1f99fc62a0bd61d5e87fcfb6d5503d
  79cc38ded1e1ac86e69c90f604efadd50b0b3762
  4ae279c2c96ab38a78b954d218790a8f6db714e5

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu-power-systems/+bug/1724120/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to