https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112697

--- Comment #6 from Martin Jambor <jamborm at gcc dot gnu.org> ---
Created attachment 56719
  --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=56719&action=edit
Perf annotate of milc built with r14-4971-g0beb1611754742

commit r14-4971-g0beb1611754742:

$ perf stat taskset -c 0 specinvoke

 Performance counter stats for 'taskset -c 0 specinvoke':

         216908.59 msec task-clock:u                     #    1.000 CPUs
utilized             
                 0      context-switches:u               #    0.000 /sec        
                 0      cpu-migrations:u                 #    0.000 /sec        
            889694      page-faults:u                    #    4.102 K/sec       
      697007650237      cycles:u                         #    3.213 GHz        
                (83.33%)
       31999772966      stalled-cycles-frontend:u        #    4.59% frontend
cycles idle        (83.33%)
      540485725923      stalled-cycles-backend:u         #   77.54% backend
cycles idle         (83.33%)
     1061256162815      instructions:u                   #    1.52  insn per
cycle            
                                                  #    0.51  stalled cycles per
insn     (83.33%)
       58760648879      branches:u                       #  270.901 M/sec      
                (83.34%)
          11890202      branch-misses:u                  #    0.02% of all
branches             (83.33%)

     216.935387643 seconds time elapsed

     211.436079000 seconds user
       5.472459000 seconds sys

$ perf record taskset -c 0 specinvoke
[ perf record: Woken up 132 times to write data ]
[ perf record: Captured and wrote 32.901 MB perf.data (862286 samples) ]


$ perf report -n --percent-limit=1 --stdio
# To display the perf.data header info, please use --header/--header-only
options.
#
#
# Total Lost Samples: 0
#
# Samples: 862K of event 'cycles:Pu'
# Event count (approx.): 695776598661
#
# Overhead       Samples  Command          Shared Object           Symbol       
# ........  ............  ...............  ...................... 
......................................
#
    22.68%        197003  milc_base.mine-  milc_base.mine-lto-gen  [.]
mult_su3_na
    20.99%        177912  milc_base.mine-  milc_base.mine-lto-gen  [.]
u_shift_fermion
    19.04%        163787  milc_base.mine-  milc_base.mine-lto-gen  [.]
mult_su3_nn
     6.85%         58509  milc_base.mine-  milc_base.mine-lto-gen  [.]
scalar_mult_add_su3_matrix
     5.51%         50953  milc_base.mine-  milc_base.mine-lto-gen  [.]
path_product
     5.40%         46083  milc_base.mine-  milc_base.mine-lto-gen  [.]
mult_su3_an
     4.22%         35853  milc_base.mine-  milc_base.mine-lto-gen  [.]
add_force_to_mom
     3.77%         32446  milc_base.mine-  milc_base.mine-lto-gen  [.]
imp_gauge_force.constprop.0
     1.98%         16848  milc_base.mine-  milc_base.mine-lto-gen  [.]
compute_gen_staple
     1.94%         16462  milc_base.mine-  milc_base.mine-lto-gen  [.]
make_anti_hermitian
     1.73%         14655  milc_base.mine-  milc_base.mine-lto-gen  [.]
mult_su3_mat_vec_sum_4dir
     1.35%         11472  milc_base.mine-  milc_base.mine-lto-gen  [.]
mult_adj_su3_mat_4vec
     1.27%         10801  milc_base.mine-  libc.so.6               [.]
__memset_avx2_unaligned_erms


$ perf annotate -n --percent-limit=1 > ~/tmp/milc-perf-annotate-0beb1611754 
(gzipeped and attached)

Reply via email to