Hi Gilles, Thanks, and yes, you are right: ompi_info --all | grep "MCA io" | grep priority MCA io: parameter "io_romio_priority" (current value: "20", data source: default, level: 9 dev/all, type: int) MCA io: parameter "io_romio_delete_priority" (current value: "20", data source: default, level: 9 dev/all, type: int) MCA io: parameter "io_ompio_priority" (current value: "10", data source: default, level: 9 dev/all, type: int) MCA io: parameter "io_ompio_delete_priority" (current value: "10", data source: default, level: 9 dev/all, type: int)
So it seems we are indeed using ROMIO. Any suggestions what that means with respect to our file coherence issue? Regards, Michael On 23 Jul 2015, at 14:07 , Gilles Gouaillardet <gilles.gouaillar...@gmail.com<mailto:gilles.gouaillar...@gmail.com>> wrote: Michael, ROMIO is the default in the 1.8 series you can run ompi_info --all | grep io | grep priority ROMIO priority should be 20 and ompio priority should be 10. Cheers, Gilles On Thursday, July 23, 2015, Schlottke-Lakemper, Michael <m.schlottke-lakem...@aia.rwth-aachen.de<mailto:m.schlottke-lakem...@aia.rwth-aachen.de>> wrote: Hi Gilles, > are you running 1.8.7 or master ? 1.8.7. We recently upgraded our cluster installation from OpenSUSE 11.3/OpenMPI 1.6.5 to OpenSUSE 12.3/OpenMPI 1.8.7. Before the upgrade, we did not encounter this problem. > if not default, which io module are you running ? > (default is ROMIO with 1.8 but ompio with master) We did not specify anything at configure time, so I guess we are using the default. But if you tell me how, I can check. > by any chance, could you post a simple program that evidences this issue ? As of this time, unfortunately no. We only experience this issue intermittently, and only when running our suite of regression tests. It *seems* to occur only with a handful of the ~40 tests, but if we run only a subset of the tests (instead of all of them), it may not occur at all, depending on the subset. I tried using a MWE program but could not reproduce the issue with it. Sorry for not being more helpful, but we are also scratching our heads trying to understand what is going on and I just thought that maybe someone here has had a similar experience in the past (or might give us some pointers at what to look at). Regards, Michael