I'm receiving the error posted at the bottom of this message with a code compiled with Intel Fortran/C Version 11.1 against OpenMPI version 1.3.2.
The same code works correctly when compiled against MPICH2. (We have re-compiled with OpenMPI to take advantage of newly-installed Infiniband hardware. The "ring" test problem appears to work correctly over Infiniband.) There are no "fork()" calls in our code, so I can only guess that something weird is going on with MPI_COMM_WORLD. The code in question is a Fortran 90 code. Right now, it is being compiled with "include 'mpif.h'" statements at the beginning of each MPI subroutine, instead of making use of the "mpi" modules. Could this be causing the problem? How else should I go about diagnosing the problem? Thanks, Greg -------------------------------------------------------------------------- An MPI process has executed an operation involving a call to the "fork()" system call to create a child process. Open MPI is currently operating in a condition that could result in memory corruption or other system errors; your MPI job may hang, crash, or produce silent data corruption. The use of fork() (or system() or other calls that create child processes) is strongly discouraged. The process that invoked fork was: Local host: bl316 (PID 26806) MPI_COMM_WORLD rank: 0 If you are *absolutely sure* that your application will successfully and correctly survive a call to fork(), you may disable this warning by setting the mpi_warn_on_fork MCA parameter to 0. -------------------------------------------------------------------------- [bl205:5014] *** An error occurred in MPI_Cart_create [bl205:5014] *** on communicator MPI_COMM_WORLD [bl205:5014] *** MPI_ERR_ARG: invalid argument of some other kind [bl205:5014] *** MPI_ERRORS_ARE_FATAL (your MPI job will now abort) -------------------------------------------------------------------------- mpirun has exited due to process rank 4 with PID 5010 on node bl205 exiting without calling "finalize". This may have caused other processes in the application to be terminated by signals sent by mpirun (as reported here). -------------------------------------------------------------------------- [bl205:05008] 7 more processes have sent help message help-mpi-errors.txt / mpi_errors_are_fatal [bl205:05008] Set MCA parameter "orte_base_help_aggregate" to 0 to see all help / error messages