Re: Intermittent system hangs on 7.2-RELEASE-p1

Linda Messerschmidt Thu, 10 Sep 2009 12:13:40 -0700

On Thu, Sep 10, 2009 at 2:46 PM, Julian Elischer<jul...@elischer.org> wrote:
> I've noticed that schedgraph tends to show the idle threads slightly
> skewed one way or the other.  I think there is a cumulative rounding
> error in the way they are drawn due to the fact that they are run so
> often.  Check the raw data and I think you will find that you just
> need to imagine the idle threads slightly to the left or right a bit.


No, there's no period anywhere in the trace where either idle thread
didn't run for an entire second.

I'm pretty sure schedgraph is throwing in some nonsense results.  I
did capture a second, larger, dataset after a 2.1s stall, and
schedgraph includes an httpd process that supposedly spent 58 seconds
on the run queue.  I don't know if it's a dropped record or a parsing
error or what.

I do think on this second graph I can kind of see the *end* of the
stall, because all of a sudden a ton of processes... everything from
sshd to httpd to gmond to sh to vnlru to bufdaemon to fdc0... comes
off of whatever it's waiting on and hits the run queue.  The combined
run queues for both processors spike up to 32 tasks at one point and
then rapidly tail off as things return to normal.

That pretty much matches the behavior shown by ktrace in my initial
post, where everything goes to sleep on something-or-other in the
kernel, and then at the end of the stall, everything wakes up at the
same time.

I think this means the problem is somehow related to locking, rather
than scheduling.
_______________________________________________
freebsd-hackers@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
To unsubscribe, send any mail to "freebsd-hackers-unsubscr...@freebsd.org"

Re: Intermittent system hangs on 7.2-RELEASE-p1

Reply via email to