The libgomp.oacc-fortran/privatized-ref-1.f95 also has the same issue, and has been fixed in the same way.

Kwok

On 16/09/2020 11:33 am, Kwok Cheung Yeung wrote:
Hello

The libgomp.oacc-c++/privatized-ref-[23].C testcases request 64 workers in a parallel section, but Nvidia only supports a maximum of 32 workers, and GCN a maximum of 16. The worker numbers are overridden by the compiler with a warning message printed, which causes test failures on Nvidia. On GCN, the warning message is not printed by default (enabled by -foffload=-Wopenacc-dims).

This patch fixes this by requesting 16 workers, which is acceptable on both Nvidia and AMD GCN. Committed on the OG10 branch as obvious.

Kwok
commit abe747ac4e1e7c6a14b0877b88a7555b11e6bcb1
Author: Kwok Cheung Yeung <k...@codesourcery.com>
Date:   Thu Sep 17 03:16:04 2020 -0700

    Fix warning messages in libgomp.oacc-fortran/privatized-ref-1.f95 testcase
    
    2020-09-17  Kwok Cheung Yeung  <k...@codesourcery.com>
    
        libgomp/
        * testsuite/libgomp.oacc-fortran/privatized-ref-1.f95 (workers, 
vectors):
        Reduce number of workers to 16.

diff --git a/libgomp/ChangeLog.omp b/libgomp/ChangeLog.omp
index 890a4e2..fbe8bb4 100644
--- a/libgomp/ChangeLog.omp
+++ b/libgomp/ChangeLog.omp
@@ -1,3 +1,8 @@
+2020-09-17  Kwok Cheung Yeung  <k...@codesourcery.com>
+
+       * testsuite/libgomp.oacc-fortran/privatized-ref-1.f95 (workers, 
vectors):
+       Reduce number of workers to 16.
+
 2020-09-16  Kwok Cheung Yeung  <k...@codesourcery.com>
 
        * testsuite/libgomp.oacc-fortran/privatized-ref-2.f90: XFAIL on nvptx.
diff --git a/libgomp/testsuite/libgomp.oacc-fortran/privatized-ref-1.f95 
b/libgomp/testsuite/libgomp.oacc-fortran/privatized-ref-1.f95
index f16f69c..e4b8520 100644
--- a/libgomp/testsuite/libgomp.oacc-fortran/privatized-ref-1.f95
+++ b/libgomp/testsuite/libgomp.oacc-fortran/privatized-ref-1.f95
@@ -34,7 +34,7 @@ contains
     integer :: i, j
     real, intent(out) :: res(:)
 
-    !$acc parallel copyout(res) num_gangs(64) num_workers(64)
+    !$acc parallel copyout(res) num_gangs(64) num_workers(16)
 
     !$acc loop gang
     do i=0,255
@@ -54,7 +54,7 @@ contains
     integer :: i, j
     real, intent(out) :: res(:)
 
-    !$acc parallel copyout(res) num_gangs(64) num_workers(64)
+    !$acc parallel copyout(res) num_gangs(64) num_workers(16)
 
     !$acc loop gang worker
     do i=0,255

Reply via email to