On Mon, Jul 01, 2013 at 11:16:01AM -0700, Josh Triplett wrote:
> On Mon, Jul 01, 2013 at 08:52:20AM -0700, Paul E. McKenney wrote:
> > On Mon, Jul 01, 2013 at 08:31:50AM -0700, Josh Triplett wrote:
> > > On Fri, Jun 28, 2013 at 01:10:17PM -0700, Paul E. McKenney wrote:
> > > > From: "Paul E. McKenney" <paul...@linux.vnet.ibm.com>
> > > > 
> > > > This commit adds fields to the rcu_dyntick structure that are used to
> > > > detect idle CPUs.  These new fields differ from the existing ones in
> > > > that the existing ones consider a CPU executing in user mode to be idle,
> > > > where the new ones consider CPUs executing in user mode to be busy.
> > > 
> > > Can you explain, both in the commit messages and in the comments added
> > > by the next commit, *why* this code doesn't consider userspace a
> > > quiescent state?
> > 
> > Good point!  Does the following explain it?
> > 
> >     Although one of RCU's quiescent states is usermode execution,
> >     it is not a full-system idle state.  This is because the purpose
> >     of the full-system idle state is not RCU, but rather determining
> >     when accurate timekeeping can safely be disabled.  Whenever
> >     accurate timekeeping is required in a CONFIG_NO_HZ_FULL kernel,
> >     at least one CPU must keep the scheduling-clock tick going.
> >     If even one CPU is executing in user mode, accurate timekeeping
> >     is requires, particularly for architectures where gettimeofday()
> >     and friends do not enter the kernel.  Only when all CPUs are
> >     really and truly idle can accurate timekeeping be disabled,
> >     allowing all CPUs to turn off the scheduling clock interrupt,
> >     thus greatly improving energy efficiency.
> > 
> >     This naturally raises the question "Why is this code in RCU rather
> >     than in timekeeping?", and the answer is that RCU has the data
> >     and infrastructure to efficiently make this determination.
> 
> Good explanation, thanks.
> 
> This also naturally raises the question "How can we let userspace get
> accurate time without forcing a timer tick?".

We don't.  ;-)

Without CONFIG_NO_HZ_FULL, if a CPU is running in user mode, that CPU
takes scheduling-clock interrupts.  User-mode code will therefore always
see accurate time.  For some definition of "accurate", anyway.

With CONFIG_NO_HZ_FULL and without CONFIG_NO_HZ_FULL_SYSIDLE, a single
designated CPU will always be taking scheduling-clock interrupts, which
again ensures that user-mode code will always see accurate time.

With both CONFIG_NO_HZ_FULL and CONFIG_NO_HZ_FULL_SYSIDLE, if
any CPU other than the timekeeping CPU is nonidle (where "nonidle"
includes usermode execution), then the timekeeping CPU will be taking
scheduling-clock interrupts, yet again ensuring that user-mode code will
always see accurate time.  If all CPUs are idle (in other words, we are
in RCU_SYSIDLE_FULL_NOTED state and the timekeeping CPU is also idle),
scheduling-clock interrupts will be globally disabled.  Or will be,
once I fix the bug noted by Frederic.

I am guessing that you would like this added to the explanation?  ;-)

                                                        Thanx, Paul

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Reply via email to