On Tue, Jan 26, 2016 at 03:41:35PM +0100, Jiri Olsa wrote: > On Tue, Jan 26, 2016 at 11:10:25PM +0900, Namhyung Kim wrote: > > On Tue, Jan 26, 2016 at 02:27:26PM +0100, Jiri Olsa wrote: > > > On Tue, Jan 26, 2016 at 09:51:59PM +0900, Namhyung Kim wrote: > > > > On Tue, Jan 26, 2016 at 01:14:47PM +0100, Jiri Olsa wrote: > > > > > On Sun, Jan 24, 2016 at 10:53:23PM +0900, Namhyung Kim wrote: > > > > > > Hello, > > > > > > > > > > > > This patchset tries to implement percent limit to callchains which > > > > > > was > > > > > > requested by Andi Kleen. For some reason, limiting callchains by > > > > > > (overhead) percentage didn't work well. This patch fixes it and > > > > > > make > > > > > > --percent-limit also works for callchains as well as hist entries. > > > > > > > > > > > > This is available on 'perf/callchain-limit-v1' branch in my tree: > > > > > > > > > > > > > > > > > > git://git.kernel.org/pub/scm/linux/kernel/git/namhyung/linux-perf.git > > > > > > > > > > > > Any comments are welcome, > > > > > > > > > > > > Thanks, > > > > > > Namhyung > > > > > > > > > > > > > > > > > > Namhyung Kim (12): > > > > > > perf report: Apply --percent-limit to callchains also > > > > > > perf report: Apply callchain percent limit on --stdio > > > > > > perf report: Get rid of hist_entry__callchain_fprintf() > > > > > > perf report: Fix percent calculation on --stdio > > > > > > perf report: Hide output pipe for percent-limited callchains on > > > > > > stdio > > > > > > perf hists browser: Fix dump to show correct callchain style > > > > > > perf hists browser: Fix callchain_node__count_rows() > > > > > > perf hists browser: Apply callchain percent limit > > > > > > perf hists browser: Fix callchain counting when press ENTER key > > > > > > perf hists browser: Fix counting callchains when expand/collapse > > > > > > all > > > > > > perf hists browser: Update percent base for fractal callchain mode > > > > > > perf report: Fix callchain percent limit on --gtk > > > > > > > > > > is 0.5 the default or one has to use the --percent-limit 0.5 > > > > > for the limit to be effective? > > > > > > > > Yes, it's effective now. I also think we need to change the default > > > > limit of 0.5. It was set for 'fractal' mode initially AFAIK so its > > > > percentage is relative to each node. In this case 0.5% of limit makes > > > > sense because it'll be a very small (absolute) value. > > > > > > > > But With 'graph' mode (now default), there're many entries under 0.5 > > > > overhead and they silently won't show callchains anymore. Actually I > > > > was confused by it when working with this patchset. > > > > > > > > What about 0.005% for the new default? > > > > > > > > > > > > > > > > > > without the option I'm getting empty callchains that are below 0.5 > > > > > but only in TUI mode (attached).. --stdio shows them all unfolded > > > > > > > > It should not show them all. But I found that I missed a check for > > > > a stdio case. Could you please test below? > > > > > > did not help, it's still there.. same output as before > > > > Hmm.. strange, could you show me the (part of) stdio output? > > > > yea, that one changed as well.. no callchains now, attached > > > jirka > > > --- > [jolsa@krava perf]$ ./perf report --stdio > > ... > > 46.69% 46.69% ls [kernel.vmlinux] [k] intel_bts_enable_local > > | > ---0x1000 > __statfs > entry_SYSCALL_64_fastpath > sys_statfs > SYSC_statfs > user_statfs > user_path_at_empty > filename_lookup > path_lookupat > link_path_walk > inode_permission > __inode_permission > kernfs_iop_permission > kernfs_refresh_inode > security_inode_notifysecctx > selinux_inode_notifysecctx > selinux_inode_setsecurity > security_context_to_sid > security_context_to_sid_core > string_to_context_struct > hashtab_search > apic_timer_interrupt > smp_apic_timer_interrupt > local_apic_timer_interrupt > hrtimer_interrupt > __hrtimer_run_queues > tick_sched_timer > tick_sched_handle.isra.17 > update_process_times > scheduler_tick > perf_event_task_tick > perf_pmu_enable.part.87 > x86_pmu_enable > intel_pmu_enable_all > intel_bts_enable_local > > 0.08% 0.00% perf [kernel.vmlinux] [k] perf_pmu_enable.part.87 > > 0.08% 0.00% perf [kernel.vmlinux] [k] > perf_event_context_sched_in > 0.08% 0.00% perf [kernel.vmlinux] [k] perf_event_exec > > 0.08% 0.00% perf [kernel.vmlinux] [k] setup_new_exec > > 0.08% 0.00% perf [kernel.vmlinux] [k] load_elf_binary > > 0.08% 0.00% perf [kernel.vmlinux] [k] search_binary_handler > > 0.08% 0.00% perf [kernel.vmlinux] [k] > do_execveat_common.isra.33 > 0.08% 0.00% perf [kernel.vmlinux] [k] sys_execve > > 0.08% 0.00% perf [kernel.vmlinux] [k] return_from_execve > > 0.08% 0.00% perf [unknown] [k] 0x00007f2175b35e07 > > 0.04% 0.00% perf [kernel.vmlinux] [k] perf_event_nmi_handler > > 0.04% 0.00% perf [kernel.vmlinux] [k] nmi_handle > > 0.04% 0.00% perf [kernel.vmlinux] [k] default_do_nmi > > 0.04% 0.00% perf [kernel.vmlinux] [k] do_nmi > > 0.04% 0.00% perf [kernel.vmlinux] [k] end_repeat_nmi > > 0.04% 0.04% perf [kernel.vmlinux] [k] x86_pmu_enable > > 0.04% 0.04% perf [kernel.vmlinux] [k] native_apic_mem_write > >
What's the problem? Now by default callchains under 0.5% (absolute) will not be shown. I think this is intended output, and we need to consider changing the default percent limit. Thanks, Namhyung