Re: rcu_preempt caused oom

2018-12-17 Thread Paul E. McKenney
On Tue, Dec 18, 2018 at 02:46:43AM +, Zhang, Jun wrote: > Hello, paul > > In softirq context, and current is rcu_preempt-10, rcu_gp_kthread_wake don't > wakeup rcu_preempt. > Maybe next patch could fix it. Please help review. > > diff --git a/kernel/rcu/tree.c b/kernel/rcu/tree.c > index 0b

RE: rcu_preempt caused oom

2018-12-17 Thread He, Bo
i J ; Chang, Junxiao ; Mei, Paul Subject: RE: rcu_preempt caused oom Hello, paul In softirq context, and current is rcu_preempt-10, rcu_gp_kthread_wake don't wakeup rcu_preempt. Maybe next patch could fix it. Please help review. diff --git a/kernel/rcu/tree.c b/kernel/rcu/tree.c index 0b76

RE: rcu_preempt caused oom

2018-12-17 Thread Zhang, Jun
, December 18, 2018 07:16 To: paul...@linux.ibm.com Cc: Zhang, Jun ; Steven Rostedt ; linux-kernel@vger.kernel.org; j...@joshtriplett.org; mathieu.desnoy...@efficios.com; jiangshan...@gmail.com; Xiao, Jin ; Zhang, Yanmin ; Bai, Jie A ; Sun, Yi J ; Chang, Junxiao ; Mei, Paul Subject: RE: rcu_p

Re: rcu_preempt caused oom

2018-12-16 Thread Paul E. McKenney
On Mon, Dec 17, 2018 at 03:15:42AM +, He, Bo wrote: > for double confirm the issue is not reproduce after 90 hours, we tried only > add the enclosed patch on the easy reproduced build, the issue is not > reproduced after 63 hours in the whole weekend on 16 boards. > so current conclusion is t

RE: rcu_preempt caused oom

2018-12-16 Thread He, Bo
Zhang, Yanmin ; Bai, Jie A ; Sun, Yi J Subject: Re: rcu_preempt caused oom On Thu, Dec 13, 2018 at 09:10:12PM -0800, Paul E. McKenney wrote: > On Fri, Dec 14, 2018 at 02:40:50AM +, He, Bo wrote: > > another experiment we have done with the enclosed debug patch, and also > > h

Re: rcu_preempt caused oom

2018-12-13 Thread Paul E. McKenney
+ panic("hung_task: blocked in > > rcu_gp_kthread init"); > > + } > > + } > > ------------------ > > -Origina

Re: rcu_preempt caused oom

2018-12-13 Thread Paul E. McKenney
> -Original Message- > From: Paul E. McKenney > Sent: Friday, December 14, 2018 10:15 AM > To: He, Bo > Cc: Zhang, Jun ; Steven Rostedt ; > linux-kernel@vger.kernel.org; j...@joshtriplett.org; > math

RE: rcu_preempt caused oom

2018-12-13 Thread He, Bo
ffda RBX: 0000000e RCX: > 40045612 > [2.896396] RDX: f768172c RSI: f7d42d9c RDI: > f768172c > [2.896400] RBP: f7681708 R08: R09: > > [2.896404] R10: R11: 00

Re: rcu_preempt caused oom

2018-12-13 Thread Paul E. McKenney
gt; CONFIG_RCU_EQS_DEBUG=y > > -Original Message- > From: Paul E. McKenney > Sent: Friday, December 14, 2018 2:12 AM > To: He, Bo > Cc: Zhang, Jun ; Steven Rostedt ; > linux-kernel@vger.kernel.org; j...@joshtriplett.org; > mathieu.desnoy...@efficios.com; jiangsh

RE: rcu_preempt caused oom

2018-12-13 Thread He, Bo
Zhang, Jun ; Steven Rostedt ; linux-kernel@vger.kernel.org; j...@joshtriplett.org; mathieu.desnoy...@efficios.com; jiangshan...@gmail.com; Xiao, Jin ; Zhang, Yanmin ; Bai, Jie A ; Sun, Yi J Subject: Re: rcu_preempt caused oom On Thu, Dec 13, 2018 at 03:26:08PM +, He, Bo wrote: > one of the bo

Re: rcu_preempt caused oom

2018-12-13 Thread Paul E. McKenney
t state: RCU_GP_WAIT_GPS(1) ->state: 0x402 > delta ->gp_activity 4312486108 ->gp_req_activity 4312486108 ->gp_wake_time > 4312486108 ->gp_wake_seq 0 ->gp_seq -1200 ->gp_seq_needed -1200 > ->gp_flags 0x0 > > -Original Message- > From: Pa

Re: rcu_preempt caused oom

2018-12-12 Thread Paul E. McKenney
eq_needed) || > > + rdp->gpwrap) > > rdp->gp_seq_needed = rnp->gp_seq_needed; > > WRITE_ONCE(rdp->gpwrap, false); > > rcu_gpnum_ovf(rnp, rdp); > > > > > > -Original Message- > > From: Paul E. McKe

Re: rcu_preempt caused oom

2018-12-12 Thread Paul E. McKenney
c: Steven Rostedt ; linux-kernel@vger.kernel.org; > j...@joshtriplett.org; mathieu.desnoy...@efficios.com; > jiangshan...@gmail.com; Zhang, Jun ; Xiao, Jin > ; Zhang, Yanmin ; Bai, Jie A > ; Sun, Yi J > Subject: Re: rcu_preempt caused oom > > On Wed, Dec 12, 2018 at 11:13:22P

RE: rcu_preempt caused oom

2018-12-12 Thread Zhang, Jun
hang, Yanmin ; Bai, Jie A ; Sun, Yi J Subject: Re: rcu_preempt caused oom On Wed, Dec 12, 2018 at 11:13:22PM +, He, Bo wrote: > I don't see the rcutree.sysrq_rcu parameter in v4.19 kernel, I also checked > the latest kernel and the latest tag v4.20-rc6, not see the sysrq_rcu. >

Re: rcu_preempt caused oom

2018-12-12 Thread Paul E. McKenney
t; Cc: Steven Rostedt ; linux-kernel@vger.kernel.org; > j...@joshtriplett.org; mathieu.desnoy...@efficios.com; > jiangshan...@gmail.com; Zhang, Jun ; Xiao, Jin > ; Zhang, Yanmin ; Bai, Jie A > > Subject: Re: rcu_preempt caused oom > > On Wed, Dec 12, 2018 at 07:42:24AM -0800

RE: rcu_preempt caused oom

2018-12-12 Thread He, Bo
e, Bo Cc: Steven Rostedt ; linux-kernel@vger.kernel.org; j...@joshtriplett.org; mathieu.desnoy...@efficios.com; jiangshan...@gmail.com; Zhang, Jun ; Xiao, Jin ; Zhang, Yanmin ; Bai, Jie A Subject: Re: rcu_preempt caused oom On Wed, Dec 12, 2018 at 07:42:24AM -0800, Paul E. McKenney wrote: >

Re: rcu_preempt caused oom

2018-12-12 Thread Paul E. McKenney
On Wed, Dec 12, 2018 at 07:42:24AM -0800, Paul E. McKenney wrote: > On Wed, Dec 12, 2018 at 01:21:33PM +, He, Bo wrote: > > we reproduce on two boards, but I still not see the show_rcu_gp_kthreads() > > dump logs, it seems the patch can't catch the scenario. > > I double confirmed the CONFIG_P

Re: rcu_preempt caused oom

2018-12-12 Thread Paul E. McKenney
m: Paul E. McKenney > Sent: Wednesday, December 12, 2018 10:25 AM > To: He, Bo > Cc: Steven Rostedt ; linux-kernel@vger.kernel.org; > j...@joshtriplett.org; mathieu.desnoy...@efficios.com; > jiangshan...@gmail.com; Zhang, Jun ; Xiao, Jin > ; Zhang, Yanmin ; Bai, Jie A >

Re: rcu_preempt caused oom

2018-12-11 Thread Paul E. McKenney
Sent: Tuesday, December 11, 2018 12:47 PM > To: He, Bo > Cc: Steven Rostedt ; linux-kernel@vger.kernel.org; > j...@joshtriplett.org; mathieu.desnoy...@efficios.com; > jiangshan...@gmail.com; Zhang, Jun ; Xiao, Jin > ; Zhang, Yanmin ; Bai, Jie A > > Subject: Re: rcu_preem

RE: rcu_preempt caused oom

2018-12-10 Thread He, Bo
, Jun ; Xiao, Jin ; Zhang, Yanmin ; Bai, Jie A Subject: Re: rcu_preempt caused oom On Mon, Dec 10, 2018 at 04:38:38PM -0800, Paul E. McKenney wrote: > On Mon, Dec 10, 2018 at 06:56:18AM +, He, Bo wrote: > > Hi, > >We have start the test with the CONFIG_PROVE_RCU=y, and

Re: rcu_preempt caused oom

2018-12-10 Thread Paul E. McKenney
On Mon, Dec 10, 2018 at 04:38:38PM -0800, Paul E. McKenney wrote: > On Mon, Dec 10, 2018 at 06:56:18AM +, He, Bo wrote: > > Hi, > >We have start the test with the CONFIG_PROVE_RCU=y, and also add one 2s > > to detect the preempt rcu hang, hope we can get more useful logs tomorrow. > >

Re: rcu_preempt caused oom

2018-12-10 Thread Paul E. McKenney
On Mon, Dec 10, 2018 at 06:56:18AM +, He, Bo wrote: > Hi, >We have start the test with the CONFIG_PROVE_RCU=y, and also add one 2s to > detect the preempt rcu hang, hope we can get more useful logs tomorrow. >I also enclosed the config and the debug patches for you review. I instead

RE: rcu_preempt caused oom

2018-12-09 Thread He, Bo
, December 10, 2018 3:56 AM To: He, Bo Cc: Steven Rostedt ; linux-kernel@vger.kernel.org; j...@joshtriplett.org; mathieu.desnoy...@efficios.com; jiangshan...@gmail.com; Zhang, Jun ; Xiao, Jin ; Zhang, Yanmin ; Bai, Jie A Subject: Re: rcu_preempt caused oom On Fri, Dec 07, 2018 at 06:11:31AM -0800, Paul

Re: rcu_preempt caused oom

2018-12-09 Thread Paul E. McKenney
that in that case, boosting the priority wouldn't help. Besides, > > the scheduling-clock interrupt should also check for this, and should push > > things forward if need be. > > > > If RCU managed to put all of its callbacks into the RCU_NEXT_READY_TAIL &

Re: rcu_preempt caused oom

2018-12-07 Thread Paul E. McKenney
od checks > (RCU is supposed to have started the relevant grace period before putting > callbacks into that bucket). But that cannot be the case here, because new > callbacks are being enqueued throughout, and these would then trigger RCU's > start-a-new-grace-period checks

Re: rcu_preempt caused oom

2018-12-06 Thread Paul E. McKenney
> Thanx, Paul > > > -Original Message- > > From: Paul E. McKenney > > Sent: Wednesday, December 5, 2018 3:50 AM > > To: He, Bo > > Cc: Steven Rostedt ; > > linux-kernel@vger.kernel.org; j...@joshtri

Re: rcu_preempt caused oom

2018-12-05 Thread Paul E. McKenney
McKenney > > Sent: Monday, December 3, 2018 9:57 PM > > To: He, Bo > > Cc: Steven Rostedt ; > > linux-kernel@vger.kernel.org; j...@joshtriplett.org; > > mathieu.desnoy...@efficios.com; jiangshan...@gmail.com; Zhang, Jun > > ; Xiao, Jin ; Zhang, Yanmin > > >

RE: rcu_preempt caused oom

2018-12-05 Thread He, Bo
l.org; j...@joshtriplett.org; mathieu.desnoy...@efficios.com; jiangshan...@gmail.com; Zhang, Jun ; Xiao, Jin ; Zhang, Yanmin ; Bai, Jie A Subject: Re: rcu_preempt caused oom On Tue, Dec 04, 2018 at 07:50:04AM +, He, Bo wrote: > Hi, Paul: > the enclosed is the log trigger the 120s hung_task_pan

Re: rcu_preempt caused oom

2018-12-04 Thread Paul E. McKenney
. McKenney > Sent: Monday, December 3, 2018 9:57 PM > To: He, Bo > Cc: Steven Rostedt ; linux-kernel@vger.kernel.org; > j...@joshtriplett.org; mathieu.desnoy...@efficios.com; > jiangshan...@gmail.com; Zhang, Jun ; Xiao, Jin > ; Zhang, Yanmin > Subject: Re: rcu_preempt

RE: rcu_preempt caused oom

2018-12-03 Thread He, Bo
-- > From: Paul E. McKenney > Sent: Saturday, December 1, 2018 12:49 AM > To: He, Bo > Cc: Steven Rostedt ; > linux-kernel@vger.kernel.org; j...@joshtriplett.org; > mathieu.desnoy...@efficios.com; jiangshan...@gmail.com; Zhang, Jun > ; Xiao, Jin ; Zhang, Yanmin > > Subje

Re: rcu_preempt caused oom

2018-12-03 Thread Paul E. McKenney
gt; Sent: Saturday, December 1, 2018 12:49 AM > To: He, Bo > Cc: Steven Rostedt ; linux-kernel@vger.kernel.org; > j...@joshtriplett.org; mathieu.desnoy...@efficios.com; > jiangshan...@gmail.com; Zhang, Jun ; Xiao, Jin > ; Zhang, Yanmin > Subject: Re: rcu_preempt caused oom >

RE: rcu_preempt caused oom

2018-12-02 Thread He, Bo
ang, Jun ; Xiao, Jin ; Zhang, Yanmin Subject: Re: rcu_preempt caused oom On Fri, Nov 30, 2018 at 03:18:58PM +, He, Bo wrote: > Here is the kernel cmdline: Thank you! > Kernel command line: androidboot.acpio_idx=0 > androidboot.bootloader=efiwrapper-02_03-userdebug_kernelf

Re: rcu_preempt caused oom

2018-11-30 Thread Paul E. McKenney
, 2018 11:17 PM > To: Paul E. McKenney > Cc: He, Bo ; linux-kernel@vger.kernel.org; > j...@joshtriplett.org; mathieu.desnoy...@efficios.com; > jiangshan...@gmail.com; Zhang, Jun ; Xiao, Jin > ; Zhang, Yanmin > Subject: Re: rcu_preempt caused oom > > On Fri, 30 Nov 2018 0

RE: rcu_preempt caused oom

2018-11-30 Thread He, Bo
...@joshtriplett.org; mathieu.desnoy...@efficios.com; jiangshan...@gmail.com; Zhang, Jun ; Xiao, Jin ; Zhang, Yanmin Subject: Re: rcu_preempt caused oom On Fri, 30 Nov 2018 06:43:17 -0800 "Paul E. McKenney" wrote: > Could you please send me your list of kernel boot parameters? Th

Re: rcu_preempt caused oom

2018-11-30 Thread Steven Rostedt
On Fri, 30 Nov 2018 06:43:17 -0800 "Paul E. McKenney" wrote: > Could you please send me your list of kernel boot parameters? They > usually appear near the start of your console output. Or just: cat /proc/cmdline -- Steve

Re: rcu_preempt caused oom

2018-11-30 Thread Paul E. McKenney
t; -Original Message- > From: Paul E. McKenney > Sent: Thursday, November 29, 2018 10:27 PM > To: He, Bo > Cc: linux-kernel@vger.kernel.org; j...@joshtriplett.org; rost...@goodmis.org; > mathieu.desnoy...@efficios.com; jiangshan...@gmail.com; Zhang, Jun > ; Xiao, Jin ;

Re: rcu_preempt caused oom

2018-11-29 Thread Paul E. McKenney
On Thu, Nov 29, 2018 at 05:06:47AM -0800, Paul E. McKenney wrote: > On Thu, Nov 29, 2018 at 08:49:35AM +, He, Bo wrote: > > Hi, > > we test on kernel 4.19.0 on android, after run more than 24 Hours > > monkey stress test, we see OOM on 1/10 2G memory board, the issue is not > > seen on

Re: rcu_preempt caused oom

2018-11-29 Thread Paul E. McKenney
On Thu, Nov 29, 2018 at 08:49:35AM +, He, Bo wrote: > Hi, > we test on kernel 4.19.0 on android, after run more than 24 Hours > monkey stress test, we see OOM on 1/10 2G memory board, the issue is not seen > on the 4.14 kernel. > we have done some debugs: > 1. OOM is due to the filp co