Re: [HACKERS] Generic Monitoring Framework Proposal

Theo Schlossnagle Mon, 19 Jun 2006 16:37:46 -0700


On Jun 19, 2006, at 6:41 PM, Robert Lor wrote:

Theo Schlossnagle wrote:
Heh. Syscall probes and FBT probes in Dtrace have zerooverhead. User-space probes do have overhead, but it is only afew instructions (two I think). Besically, the probe points arereplaced by illegal instructions and the kernel infrastructurefor Dtrace will fasttrap the ops and then act. So, it is tinytiny overhead. Little enough that it isn't unreasonable toinstrument things like s_lock which are tiny.
Theo, you're a genius. FBT (funciton boundary tracing) probes havezero overhead (section 4.1) and user-space probes has twoinstructions over head (section 4.2). I was incorrect about makinga general zero overhead statement. But it's so close to zero :-)
http://www.sun.com/bigadmin/content/dtrace/dtrace_usenix.pdf
The reason that Robert proposes user-space probes (I assume) isthat tracing C functions can be too granular and not convenientlyexpose the "right" information to make tracing useful.
Yes, I'm proposing user-space probes (aka User Statically-DefinedTracing - USDT). USDT provides a high-level abstraction so theapplication can expose well defined probes without the user havingto know the detailed implementation. For example, instead ofhaving to know the function LWLockAcquire(), a well documentedprobe called lwlock_acquire with the appropriate args is much moreusable.

I am giving a talk at OSCON this year about PostgreSQL on "bigsystems". Big is all relative, but I will be talking about dtrace abit and the advantages of running PostgreSQL on Solaris which is whatwe ended up doing after some extremely disturbing experiences onLinux. I was able to track a very acute memory "leak" in pl/perl(which Neil so kindly fixed) within a few moments -- and this iswithout explicit user-space trace points. If there were good user-space points, I likely wouldn't have had to dig in the source as apre-cursor to my dtrace efforts.


The things you might be able to do with user-specific trace points:

o better understand the block scatter (distance of block-levelreads) for a specific query).o understand lock contention in vastly multiprocessor systemsusing plockstat (my hunch is that heavy-weight locks might be better).o our current box is 4 way opteron, but we have a 16-way T2000as well.o report on queries including turn-around time, block-accesses,lock acquisitions grouped by query for specific time windows.

The nice thing about dtrace is that it requires no "prep" to look ata problem. When something is acting odd in production, you don'twant to attempt to repeat it in a test environment first. You wantto observe it. Dtrace allows you to dig in "really deep" inproduction with an acceptable performance penalty and ask questionsthat couldn't be asked before. It is exceptionally clever stuff. Ofall the new "neat stuff" in Solaris 10, it has my vote for coolestand most useful. I've nailed several production problems (outsideof Postgres) using dtrace with accuracy and efficiency. When Solaris10u2 is released, we'll be trying Postgres on ZFS, so my rankings maychange :-)

The idea of having intelligently placed dtrace probes in Postrgreswould allow us to deal with postgres as a "first class" app onSolaris 10 with respect to troubleshooting obtuse productionproblems. That, to me, is exciting stuff.


Best regards,

Theo

// Theo Schlossnagle
// CTO -- http://www.omniti.com/~jesus/
// OmniTI Computer Consulting, Inc. -- http://www.omniti.com/
// Ecelerity: Run with it.



---------------------------(end of broadcast)---------------------------
TIP 2: Don't 'kill -9' the postmaster

Re: [HACKERS] Generic Monitoring Framework Proposal

Reply via email to