Re: [OMPI users] MPI-IO Inconsistency over Lustre using OMPI 1.3

2009-03-03 Thread Nathan Baca
Thanks for the quick reply and suggestions. I have tried both isolating the output to a single OST and striping across multiple OSTs. Both will produce the same result. I have tried compiling with multiple versions of both pathscale and intel compilers all with the same result. The odd thing is t

[OMPI users] libnuma under ompi 1.3

2009-03-03 Thread Terry Frankcombe
Having just downloaded and installed Open MPI 1.3 with ifort and gcc, I merrily went off to compile my application. In my final link with mpif90 I get the error: /usr/bin/ld: cannot find -lnuma Adding --showme reveals that -I/home/terry/bin/Local/include -pthread -I/home/terry/bin/Local/lib is

Re: [OMPI users] MPI-IO Inconsistency over Lustre using OMPI 1.3

2009-03-03 Thread Brian Dobbins
Hi Nathan, I just ran your code here and it worked fine - CentOS 5 on dual Xeons w/ IB network, and the kernel is 2.6.18-53.1.14.el5_lustre.1.6.5smp. I used an OpenMPI 1.3.0 install compiled with Intel 11.0.081 and, independently, one with GCC 4.1.2. I tried a few different times with varying

[OMPI users] MPI-IO Inconsistency over Lustre using OMPI 1.3

2009-03-03 Thread Nathan Baca
Hello, I am seeing inconsistent mpi-io behavior when writing to a Lustre file system using open mpi 1.3 with romio. What follows is a simple reproducer and output. Essentially one or more of the running processes does not read or write the correct amount of data to its part of a file residing on a

[OMPI users] Calculation stuck in MPI

2009-03-03 Thread Ondrej Marsalek
Dear everyone, I have a calculation (the CP2K program) using MPI over Infiniband and it is stuck. All processes (16 on 4 nodes) are running, taking 100% CPU. Attaching a debugger reveals this (only the end of the stack shown here): (gdb) backtrace #0 0x2b3460916dbf in btl_openib_component_pr