On Thu, Oct 01, 2015 at 08:10:41PM +0100, Russell King - ARM Linux wrote: > On Thu, Oct 01, 2015 at 10:26:47AM -0700, Drew Richardson wrote: > > The layout of stack frames has changed over time. Testing using a > > arm-linux-gnueabi gcc-4.2 from 2007 the original code didn't work but > > this new code does. It also works with clang as well as newer versions > > of gcc. > > Can you point to a modern ARM distribution where perf actually works with > calltraces into userspace?
I am not aware of an ARM distribution where it works, that's the problem. I optimistically said 'The layout of stack frames has changed over time,' but I couldn't find any case where it worked (including digging up an ARM compiler from 2007) This is from 4.3-rc3 on Gentoo using 'perf record -ga ./dhrystone' then 'perf report -g'. 1.36% dhrystone dhrystone [.] Func_3 | --- Func_3 | |--85.61%-- 0x59 | --14.39%-- 0x7ec5d5ac And this is after the proposed changes 1.99% dhrystone dhrystone [.] Func_3 | --- Func_3 | |--87.45%-- cmd_report | Proc_1 | main | 0x0 | --12.55%-- Proc_1 main 0x0 The call stack unwinding isn't perfect, for example leaf functions may not write a stack frame at all, but it's hopefully better than it was. Drew Richardson -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/