On 4/24/2012 6:19 AM, Syed Ahsan Ali wrote:
I am not familiar with attaching debugger to the processes. Other
things you asked are as follows:
The easiest is to get Totalview or Allinea (both are parallel debuggers)
and attach them to the job. However they cost. Another is to try padb,
look
I am not familiar with attaching debugger to the processes. Other things
you asked are as follows:
Is this the first time you've ran it (with Open MPI? with any MPI?) *No
We have been running this and other models but this problem has arised now
* How many processes is the job using? Are you o
To determine if an MPI process is waiting for a message do what Rayson
suggested and attach a debugger to the processes and see if any of them
are stuck in MPI. Either internally in a MPI_Recv or MPI_Wait call or
looping on a MPI_Test call.
Other things to consider.
Is this the first time y
Dear Rayson,
That is a Nuemrical model that is written by National weather service of a
country. The logs of the model show every detail about the simulation
progress. I have checked on the remote nodes as well the application binary
is running but the logs show no progress, it is just waiting at
Seems like there's a bug in the application. Did you or someone else
write it, or did you get it from an ISV??
You can log onto one of the nodes, attach a debugger, and see if the
MPI task is waiting for a message (looping in one of the MPI receive
functions)...
Rayson
==
Dear All,
I am having problem with running an application on Dell cluster . The model
starts well but no further progress is shown. It just stuck. I have checked
the systems, no apparent hardware error is there. Other open mpi
applications are running well on the same cluster. I have tried running