Re: [dtrace-discuss] ustack() misses caller of leaf under profile probe

Ryan Johnson Thu, 22 Jul 2010 09:32:32 -0700

On 7/22/2010 3:15 AM, Adam Leventhal wrote:

What we could do is have the ustack() action record %o7 as well and then figure 
out in user-land whether or not it's relevant.

Actually, I tried %o7 first, and it's pretty dodgy -- once a stack frame has 
been created, it can hold anything. CC seems to use it a lot for bit-shifting 
and as the target of JMPL. In contrast, %i7 is (nearly) always valid.

In leaf-context, %o7 is all you have -- the %is and %ls belong to the previous 
frame.

That's exactly why %i7 works so well compared to %o7 -- the leaf doesn'tmess with it. At all times %i7 points to a CALL (or JMPL***) somewherehigher in the call chain, but not higher than the ustack() reports asthe current function's caller. This target might be the caller of aleaf, or the first in a sequence of tail calls (which may or may notcall a leaf), but either way it's accurate to add that target to thestack trace if it's not the current function:


  1. foo -> bar --- bar owns %i7, mem[%i7] is inside foo,
     target_of(mem[%i7]) == bar (ustack is accurate)
  2. foo -> tail1 -> ... -> tailN -> bar --- same as #1, but
     target_of(mem[%i7]) == tail1 != bar (insert tail1 just above leaf)
  3. foo -> bar -> leaf --- same as #1, but target_of(mem[%i7]) == bar
     != leaf (insert bar just above leaf)
  4. foo -> bar -> tail1 -> ... tailN -> leaf --- bar owns %i7 (insert
     bar just above leaf... no way to recover tail calls)
  5. foo -> tail1 -> ... -> tailN -> bar -> leaf --- bar owns %i7,
     mem[%i7] is inside foo, target_of(mem[%i7]) == tail1 != leaf
     (insert tail1 just above leaf... no way to recover other tail
     calls or bar)
  6. foo -> tail1 -> ... -> tailN -> bar -> tailA -> ... -> tailZ ->
     foo --- combination of previous two (insert tail1 just above leaf,
     other functions lost)

In every case, if mem[%i7] is a CALL and target_of(mem[%i7]) isn't thecurrent function, then it is correct to insert that target function assecond-from-top -- it is either a lost tail call or a missing caller wecan't see because of the leaf context. The one caveat is the insertedcaller will not have an offset, but this doesn't matter if we're onlyreconstructing a control flow graph.

*** If %i7 points to a CALL instruction, we can decode it and computethe target address as %i7 + (int) (4*mem[%i7]). However, we're out ofluck if it was a JMPL instruction, because we can't assume anythingabout the current content of registers used in the past to compute thetarget address.

http://wikis.sun.com/display/DTrace/Actions+and+Subroutines#ActionsandSubroutines-%7B%7Bustack%7D%7D,
 needs something resembling:

Limitations: Because ustack() must traverse stack frames to build its stack 
trace, functions which do not establish a stack frame can lead to unpredictable 
results. In particular
        • Functions making tail calls will not appear because they tear down 
their own stack frame before making the call.
        • Except inside function entry probes, leaf functions which have not (yet) established 
a stack frame sometimes prevent their caller from appearing in the stack trace (e.g. foo -> 
 bar ->  baz will appear as foo ->  baz). See<link-to-note-at-profile-provider>.

http://wikis.sun.com/display/DTrace/profile+Provider needs a new section at the 
end (before 'Stability'):

Limitations:

ustack() only reports the caller of a leaf function if the latter has 
established a stack frame.

A leaf function is typically used to refer to one that doesn't establish a 
stack frame.

OK. How about this for the ustack entry:

Limitations: ustack() only reports callers identified by a returnaddress in some stack frame:


   * Functions making tail calls do not appear because control never
     returns to them
   * Unless ustack() is called from a pid provider :::entry or
     :::return probe, a leaf function's caller will not appear because
     the return address does not reside in a stack frame. Entry/return
     probes are a special case because the return address is known even
     without a stack frame, but this not true in general

For example, suppose a program makes the following sequence of functioncalls::


foo -> tail_caller1 -> bar -> tail_caller2 -> baz

Calling ustack() from pid$target::baz:entry would report foo -> bar ->baz, because foo is mentioned in a stack frame and the pid provider canidentify bar. However, calling ustack() from an unanchored(asynchronous) context, such as a profile or tick probe, can report foo-> baz. This always occurs if baz is a leaf function (e.g. neverestablishes a stack frame), or occasionally if baz does not have a stackframe yet (or any more) because the probe fired inside prologue (orepilogue) code. Note that the two functions making tail calls do notappear in the stack trace under any circumstances.

==== and this for the profile provider entry (added at the end of thefirst paragraph)

Caveat: ustack() has some limitations when called from unanchoredcontext (See <link-to-ustack-limitations>).


=====

Incidentally, I think the ustack entry would be much easier to follow ifit presented the general case first -- ustack(void) and ustack(nframes)-- and explained what it does (similar to kstack, it can also be used asa key for aggregation). Then it could explain (in one sentence) that thestrsize arg exists to support java applications and forward-referencethe jstack entry. All the examples about java stacks could then movethere, where they belong.


Thoughts?
Ryan

_______________________________________________
dtrace-discuss mailing list
dtrace-discuss@opensolaris.org

Re: [dtrace-discuss] ustack() misses caller of leaf under profile probe

Reply via email to