Hahnfeld added a comment.

Just tested this and got very weird results for register usage:

  void func(double *a) {
    #pragma omp target teams distribute parallel for map(a[0:100]) // 
dist_schedule(static)
    for (int i = 0; i < 100; i++) {
      a[i]++;
    }
  }

Compiling with current trunk for `sm_60` (Pascal): 29 registers
Adding `dist_schedule(static)` (the previous default): 19 registers
For reference: `dist_schedule(static, 128)` also uses 29 registers

Any ideas? This significantly slows down STREAM...


Repository:
  rC Clang

https://reviews.llvm.org/D52434



_______________________________________________
cfe-commits mailing list
cfe-commits@lists.llvm.org
http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

Reply via email to