vasilis gkanis wrote:
I had a similar problem with the portland Fortran compiler. I new that this
was not caused by a network problem ( I run the code on a single node with 4
CPUs). After I tested pretty much anything, I decided to change the compiler.
I used the Intel Fortran compiler and everything is running fine.
It could be a PGI compiler voodoo :)
There were some thoughts on this e-mail thread that the problem could be
related to trac ticket 2043. Note that there has been progress on this
ticket. See https://svn.open-mpi.org/trac/ompi/ticket/2043#comment:18
. The shared-memory (on-node) communications were subject to race
conditions that could be exposed by optimizing compilers. Some signals
could have gotten lost in inter-process communications, quite possibly
leading to hangs.
If you think you got bitten by this bug, please try the revisions
mentioned in the trac ticket and report your success (or, alas, failure)
via the trac ticket or as appropriate.