On Mon, Mar 31, 2014 at 02:02:23PM +0000, Mathieu Desnoyers wrote:
> ----- Original Message -----
> > From: "Paul E. McKenney" <paul...@linux.vnet.ibm.com>
> > To: "Mathieu Desnoyers" <mathieu.desnoy...@efficios.com>
> > Cc: fweis...@gmail.com, pet...@infradead.org, linux-kernel@vger.kernel.org
> > Sent: Sunday, March 30, 2014 11:54:58 PM
> > Subject: Re: Promela/spin model for NO_HZ_FULL_SYSIDLE code
> > 
> > On Mon, Mar 31, 2014 at 03:29:25AM +0000, Mathieu Desnoyers wrote:
> > > ----- Original Message -----
> > > > From: "Paul E. McKenney" <paul...@linux.vnet.ibm.com>
> > > > To: fweis...@gmail.com, "mathieu desnoyers"
> > > > <mathieu.desnoy...@efficios.com>, pet...@infradead.org
> > > > Cc: linux-kernel@vger.kernel.org
> > > > Sent: Sunday, March 30, 2014 7:08:56 PM
> > > > Subject: Promela/spin model for NO_HZ_FULL_SYSIDLE code
> > > > 
> > > > For whatever it is worth, the following model claims safety and progress
> > > > for the sysidle state machine.
> > > > 
> > > > Thoughts?
> > > 
> > > Hi Paul,
> > > 
> > > Please keep in mind that it's been a while since I've looked at Promela
> > > proofs, but a couple of questions come to mind. First, how is the 
> > > execution
> > > script below checking for progress in any way ? I remember that we used
> > > the negation of a "_np" LTL claim to check for progress in the past.
> > > Moreover, I'd be much more comfortable seeing ifdefs in the code that
> > > inject
> > > known failures, and let them be triggered by error-triggering runs in the
> > > scripts (with e.g. -DINJECT_ERROR_XYZ), to confirm that this model 
> > > actually
> > > catches known issues.
> > 
> > Well, if I comment out the four "progress_" labels, it complains about
> > a non-progress cycle.  So at least spin does pay attention to these.  ;-)
> > 
> > If I put the "progress_" labels back in, but change the check so as to
> > prevent the state machine from ever entering RCU_SYSIDLE_FULL_NOTED,
> > it cranks through 14M states before complaining about a non-progress cycle,
> > as expected.
> > 
> > If I revert back to pristine state, and then comment out the statements
> > that reverts state back to RCU_SYSIDLE_NOT when exiting idle, an assert()
> > triggers, as expected.
> > 
> > > If we can show that the model can detect a few failure modes, both for
> > > safety
> > > and progress, then my confidence level in the model will start to improve.
> > > ;-)
> > 
> > Well, there probably is a bug in there somewhere, Murphy being who he is.
> > ;-)
> 
> One failure mode your model seems to miss is when I comment the wakeup:
> 
>                 /* If needed, wake up the timekeeper. */
>                 if
> #ifndef INJECT_NO_WAKEUP
>                 :: oldstate == RCU_SYSIDLE_FULL_NOTED ->
>                         wakeup_timekeeper = 1;
> #endif /* #ifndef INJECT_NO_WAKEUP */
>                 :: else -> skip;
>                 fi;
> 
> Somehow, I feel I am doing something very nasty to the algorithm,
> but the model checker fails to find any kind of progress or safety
> issue. Any idea why ?

Hmmm...  One reason is because I am not modeling the timekeeper thread
as waiting for the wakeup.  Let me see what I can do about that...

                                                        Thanx, Paul

> Thanks,
> 
> Mathieu
> 
> 
> > 
> >                                                     Thanx, Paul
> > 
> > > Thanks,
> > > 
> > > Mathieu
> > > 
> > > > 
> > > >                                                         Thanx, Paul
> > > > 
> > > > ------------------------------------------------------------------------
> > > > sysidle.sh
> > > > ------------------------------------------------------------------------
> > > > spin -a sysidle.spin
> > > > cc -DNP -o pan pan.c
> > > > # Fair scheduling to focus progress checks in timekeeper.
> > > > ./pan -f -l -m1280000 -w22
> > > > 
> > > > ------------------------------------------------------------------------
> > > > sysidle.spin
> > > > ------------------------------------------------------------------------
> > > > /*
> > > >  * Promela model for CONFIG_NO_HZ_FULL_SYSIDLE=y in the Linux kernel.
> > > >  * This model assumes that the dyntick-idle bit manipulation works based
> > > >  * on long usage, and substitutes a per-thread boolean "am_busy[]" array
> > > >  * for the Linux kernel's dyntick-idle masks.  The focus of this model
> > > >  * is therefore on the state machine itself.  Checks for both safety and
> > > >  * forward progress.
> > > >  *
> > > >  * This program is free software; you can redistribute it and/or modify
> > > >  * it under the terms of the GNU General Public License as published by
> > > >  * the Free Software Foundation; either version 2 of the License, or
> > > >  * (at your option) any later version.
> > > >  *
> > > >  * This program is distributed in the hope that it will be useful,
> > > >  * but WITHOUT ANY WARRANTY; without even the implied warranty of
> > > >  * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
> > > >  * GNU General Public License for more details.
> > > >  *
> > > >  * You should have received a copy of the GNU General Public License
> > > >  * along with this program; if not, you can access it online at
> > > >  * http://www.gnu.org/licenses/gpl-2.0.html.
> > > >  *
> > > >  * Copyright IBM Corporation, 2014
> > > >  *
> > > >  * Author: Paul E. McKenney <paul...@linux.vnet.ibm.com>
> > > >  */
> > > > 
> > > > #define NUM_WORKERS 3
> > > > 
> > > > byte wakeup_timekeeper = 0; /* Models rcu_kick_nohz_cpu(). */
> > > > 
> > > > #define RCU_SYSIDLE_NOT         0       /* Some CPU is not idle. */
> > > > #define RCU_SYSIDLE_SHORT       1       /* All CPUs idle for brief 
> > > > period. */
> > > > #define RCU_SYSIDLE_LONG        2       /* All CPUs idle for long 
> > > > enough. */
> > > > #define RCU_SYSIDLE_FULL        3       /* All CPUs idle, ready for 
> > > > sysidle. */
> > > > #define RCU_SYSIDLE_FULL_NOTED  4       /* Actually entered sysidle 
> > > > state. */
> > > > 
> > > > byte full_sysidle_state = RCU_SYSIDLE_NOT;
> > > > 
> > > > byte am_busy[NUM_WORKERS];  /* Busy is similar to "not dyntick-idle". */
> > > > byte am_setup[NUM_WORKERS]; /* Setup means timekeeper knows I am not
> > > > idle. */
> > > > 
> > > > /*
> > > >  * Non-timekeeping CPU going into and out of dyntick-idle state.
> > > >  */
> > > > proctype worker(byte me)
> > > > {
> > > >         byte oldstate;
> > > > 
> > > >         do
> > > >         :: 1 ->
> > > >                 /* Go idle. */
> > > >                 am_setup[me] = 0;
> > > >                 am_busy[me] = 0;
> > > > 
> > > >                 /* Dyntick-idle in the following loop. */
> > > >                 do
> > > >                 :: 1 -> skip;
> > > >                 :: 1 -> break;
> > > >                 od;
> > > > 
> > > >                 /* Exit idle loop, model getting out of dyntick idle 
> > > > state. */
> > > >                 am_busy[me] = 1;
> > > > 
> > > >                 /* Get state out of full-system idle states. */
> > > >                 atomic {
> > > >                         oldstate = full_sysidle_state;
> > > >                         if
> > > >                         :: oldstate > RCU_SYSIDLE_SHORT ->
> > > >                                 full_sysidle_state = RCU_SYSIDLE_NOT;
> > > >                         :: else -> skip;
> > > >                         fi;
> > > >                 }
> > > > 
> > > >                 /* If needed, wake up the timekeeper. */
> > > >                 if
> > > >                 :: oldstate == RCU_SYSIDLE_FULL_NOTED ->
> > > >                         wakeup_timekeeper = 1;
> > > >                 :: else -> skip;
> > > >                 fi;
> > > > 
> > > >                 /* Mark ourselves fully awake and operational. */
> > > >                 am_setup[me] = 1;
> > > > 
> > > >                 /* We are fully awake, so timekeeper must not be 
> > > > asleep. */
> > > >                 assert(full_sysidle_state < RCU_SYSIDLE_FULL);
> > > > 
> > > >                 /* Running in kernel in the following loop. */
> > > >                 do
> > > >                 :: 1 -> skip;
> > > >                 :: 1 -> break;
> > > >                 od;
> > > >         od
> > > > }
> > > > 
> > > > /*
> > > >  * Are all the workers in dyntick-idle state?
> > > >  */
> > > > #define check_idle() \
> > > >         i = 0; \
> > > >         idle = 1; \
> > > >         do \
> > > >         :: i < NUM_WORKERS -> \
> > > >                 if \
> > > >                 :: am_busy[i] == 1 -> idle = 0; \
> > > >                 :: else -> skip; \
> > > >                 fi; \
> > > >                 i++; \
> > > >         :: i >= NUM_WORKERS -> break; \
> > > >         od
> > > > 
> > > > /*
> > > >  * Timekeeping CPU.
> > > >  */
> > > > proctype timekeeper()
> > > > {
> > > >         byte i;
> > > >         byte idle;
> > > >         byte curstate;
> > > >         byte newstate;
> > > > 
> > > >         do
> > > >         :: 1 ->
> > > >                 /* Capture current state. */
> > > >                 check_idle();
> > > >                 curstate = full_sysidle_state;
> > > >                 newstate = curstate;
> > > > 
> > > >                 /* Check for acceptance state. */
> > > >                 if
> > > >                 :: idle == 0 ->
> > > > progress_idle:
> > > >                         skip;
> > > >                 :: curstate == RCU_SYSIDLE_NOT ->
> > > > progress_idle_reset:
> > > >                         skip;
> > > >                 :: else -> skip;
> > > >                 fi;
> > > > 
> > > >                 /* Manage state... */
> > > >                 if
> > > >                 :: idle == 1 && curstate < RCU_SYSIDLE_FULL_NOTED ->
> > > >                         /* Idle, advance to next state. */
> > > >                         atomic {
> > > >                                 if
> > > >                                 :: full_sysidle_state == curstate ->
> > > >                                         newstate = curstate + 1;
> > > >                                         full_sysidle_state = newstate;
> > > >                                 :: else -> skip;
> > > >                                 fi;
> > > >                         }
> > > >                 :: idle == 0 && full_sysidle_state >= RCU_SYSIDLE_LONG 
> > > > ->
> > > >                         /* Non-idle and state advanced, revert to base 
> > > > state. */
> > > >                         full_sysidle_state = RCU_SYSIDLE_NOT;
> > > >                 :: else -> skip;
> > > >                 fi;
> > > > 
> > > >                 /* If in RCU_SYSIDLE_FULL_NOTED, wait to be awakened. */
> > > >                 do
> > > >                 :: newstate != RCU_SYSIDLE_FULL_NOTED &&
> > > >                    wakeup_timekeeper == 1 ->
> > > >                         assert(0); /* Should never get here. */
> > > >                 :: newstate != RCU_SYSIDLE_FULL_NOTED &&
> > > >                    wakeup_timekeeper == 0 ->
> > > >                         break;
> > > >                 :: newstate == RCU_SYSIDLE_FULL_NOTED &&
> > > >                    wakeup_timekeeper == 1 ->
> > > > progress_full_system_idle_1:
> > > >                         assert(full_sysidle_state == RCU_SYSIDLE_NOT);
> > > >                         wakeup_timekeeper = 0;
> > > >                         break;
> > > >                 :: newstate == RCU_SYSIDLE_FULL_NOTED &&
> > > >                    wakeup_timekeeper == 0 ->
> > > > progress_full_system_idle_2:
> > > > 
> > > >                         /* We are asleep, so all workers better be 
> > > > idle. */
> > > >                         atomic {
> > > >                                 i = 0;
> > > >                                 idle = 1;
> > > >                                 do
> > > >                                 :: i < NUM_WORKERS ->
> > > >                                         if
> > > >                                         :: am_setup[i] -> idle = 0;
> > > >                                         :: else -> skip;
> > > >                                         fi;
> > > >                                         i++;
> > > >                                 :: i >= NUM_WORKERS -> break;
> > > >                                 od;
> > > >                                 assert(idle == 1 ||
> > > >                                        full_sysidle_state < 
> > > > RCU_SYSIDLE_FULL);
> > > >                         }
> > > >                 od;
> > > >                 assert(full_sysidle_state <= RCU_SYSIDLE_FULL_NOTED);
> > > >         od;
> > > > }
> > > > 
> > > > init {
> > > >         byte i = 0;
> > > > 
> > > >         do
> > > >         :: i < NUM_WORKERS ->
> > > >                 am_busy[i] = 1;
> > > >                 am_setup[i] = 1;
> > > >                 run worker(i);
> > > >                 i++;
> > > >         :: i >= NUM_WORKERS -> break;
> > > >         od;
> > > >         run timekeeper();
> > > > }
> > > > 
> > > > 
> > > 
> > > --
> > > Mathieu Desnoyers
> > > EfficiOS Inc.
> > > http://www.efficios.com
> > > 
> > 
> > 
> 
> -- 
> Mathieu Desnoyers
> EfficiOS Inc.
> http://www.efficios.com
> 

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Reply via email to