Power8 Perforence Monitoring Unit (PMU) supports different sampling modes (SM) Random Instruction Sampling (RIS), Random Load/Store Facility Sampling (RLS) and Random Branch Sampling (RBS). This patchset enabled RLS mode to mark Load/Store instructions to save the memory hierarchy level (eg: L2, L3) for cache reload. Event used here to sample is "marked instruction complete".
This patchset exports the hierarchy information to the user via the perf_mem_data_src object. Patchset is based on and derived from Sukadev Bhattiprolu work[1]. It exports the memory hierarchy information only to Power8 processor based system, since similiar event or modes are not supported in Power7 [1]:https://lkml.org/lkml/2013/10/15/858 perf interface sample. Workload used here is "ebizzy" # perf report -n --mem-mode --sort=mem,sym,dso,symbol_daddr --stdio # To display the perf.data header info, please use --header/--header-only options. # # Samples: 33 of event 'cpu/mem_access/' # Total weight : 33 # Sort order : mem,sym,dso,symbol_daddr # # Overhead Samples Memory access Symbol Shared Object Data Symbol # ........ ............ ........................ .................................. ................. ......................................... # 12.12% 4 L2 hit [k] __do_softirq [kernel.kallsyms] [k] softirq_vec+0x8 6.06% 2 L2 hit [k] __schedule [kernel.kallsyms] [k] 0xc0000003bb975df0 6.06% 2 Remote Cache (1 hop) hit [k] scheduler_tick [kernel.kallsyms] [k] 0xc0000003feb1e4b0 3.03% 1 L2 hit [k] __acct_update_integrals [kernel.kallsyms] [k] 0xc0000003b79d8a30 3.03% 1 L2 hit [.] __memcpy_power7 libc-2.17.so [.] 0x0000010021349b00 3.03% 1 L2 hit [.] __memcpy_power7 libc-2.17.so [.] 0x000001002134a030 3.03% 1 L2 hit [k] __mmdrop [kernel.kallsyms] [k] pgtable_cache+0x58 3.03% 1 L2 hit [k] __update_cpu_load [kernel.kallsyms] [k] 0xc0000003feb1dc60 3.03% 1 L2 hit [.] _int_malloc libc-2.17.so [.] 0x00003fff90000090 3.03% 1 L2 hit [k] account_system_time [kernel.kallsyms] [k] 0xc0000003feb08088 ..... Madhavan Srinivasan (8): powerpc/perf: Remove PME_ prefix for power7 events powerpc/perf: Export Power8 generic events in sysfs powerpc/perf: EVENT macro for exporting generic events powerpc/perf: Add Power8 mem_access event to sysfs powerpc/perf: Define big-endian version of perf_mem_data_src powerpc/perf: Export Power8 memory hierarchy info to user space powerpc/perf: Set data source value powerpc/perf: cleanup in perf_event_print_debug() arch/powerpc/include/asm/perf_event_server.h | 4 +- arch/powerpc/perf/core-book3s.c | 17 ++++- arch/powerpc/perf/power7-pmu.c | 18 ++--- arch/powerpc/perf/power8-events-list.h | 21 ++++++ arch/powerpc/perf/power8-pmu.c | 98 ++++++++++++++++++++++++++-- include/uapi/linux/perf_event.h | 16 +++++ 6 files changed, 155 insertions(+), 19 deletions(-) create mode 100644 arch/powerpc/perf/power8-events-list.h -- 1.9.1 _______________________________________________ Linuxppc-dev mailing list Linuxppc-dev@lists.ozlabs.org https://lists.ozlabs.org/listinfo/linuxppc-dev