https://bugs.llvm.org/show_bug.cgi?id=41686

            Bug ID: 41686
           Summary: gather cycle counts inaccurate on Intel Hardware
           Product: tools
           Version: trunk
          Hardware: PC
                OS: Windows NT
            Status: NEW
          Severity: enhancement
          Priority: P
         Component: llvm-mca
          Assignee: unassignedb...@nondot.org
          Reporter: jon.kenn...@intel.com
                CC: andrea.dibia...@gmail.com, llvm-bugs@lists.llvm.org,
                    matthew.da...@sony.com

https://godbolt.org/z/yA_GLT

The above code snippet approximates the sequence of instructions (from the
inner loop) to take approx 3303 cycles for 100 iterations. This includes a
gather. 

When run on a real skylake based CPU (i7-8700K), it takes approximately ~1230
cycles.

Please update the latency/throughput numbers for the gather instruction within
llvm-mca to better reflect the real performance of the intel architectures.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
_______________________________________________
llvm-bugs mailing list
llvm-bugs@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-bugs

Reply via email to