I will have a look at those tests. The recent fixes were not correctness, but 
performance fixes.
Nevertheless, we used to pass the mpich tests, but I admit that it is not a 
testsuite that we run regularly, I will have a look at them. The atomicity 
tests are expected to fail, since this the one chapter of MPI I/O that is not 
implemented in ompio.

Thanks
Edgar

-----Original Message-----
From: users <users-boun...@lists.open-mpi.org> On Behalf Of Dave Love via users
Sent: Thursday, January 14, 2021 5:46 AM
To: users@lists.open-mpi.org
Cc: Dave Love <dave.l...@manchester.ac.uk>
Subject: [OMPI users] 4.1 mpi-io test failures on lustre

I tried mpi-io tests from mpich 4.3 with openmpi 4.1 on the ac922 system that I 
understand was used to fix ompio problems on lustre.  I'm puzzled that I still 
see failures.

I don't know why there are disjoint sets in mpich's test/mpi/io and 
src/mpi/romio/test, but I ran all the non-Fortran ones with MCA io defaults 
across two nodes.  In src/mpi/romio/test, atomicity failed (ignoring error and 
syshints); in test/mpi/io, the failures were setviewcur, tst_fileview, 
external32_derived_dtype, i_bigtype, and i_setviewcur.  tst_fileview was 
probably killed by the 100s timeout.

It may be that some are only appropriate for romio, but no-one said so before 
and they presumably shouldn't segv or report libc errors.

I built against ucx 1.9 with cuda support.  I realize that has problems on 
ppc64le, with no action on the issue, but there's a limit to what I can do.  
cuda looks relevant since one test crashes while apparently trying to register 
cuda memory; that's presumably not ompio's fault, but we need cuda.

Reply via email to