On 08/16/2016 05:02 AM, [email protected] wrote:
+static inline void prefetch_vector_loop(const VECTYPE *p, int index)
+{
+#if defined(__aarch64__)
+ if (is_thunderx_pass2_cpu()) {
+ /* Prefetch 4 cache lines ahead from index */
+ VEC_PREFETCH(p, index + (BUFFER_FIND_NONZERO_OFFSET_UNROLL_FACTOR *
4));
+ }
+#endif
+}
Oh come now. This is even worse than before. A function call protecting a mere prefetch within the main body of an inner loop?
Did you not understand what I was asking for? r~
