Re: [OMPI users] About debugging and asynchronous communication

2014-09-19 Thread Nick Papior Andersen
Well, if it is the next message then I guess you have a bug, your counter is not consistent. I am pretty sure the error is on your side, I do something similar but have never experienced anything like that. :) 2014-09-19 3:21 GMT+02:00 XingFENG : > Thanks for your advice. I added tags for message

Re: [OMPI users] File locking in ADIO, OpenMPI 1.6.4

2014-09-19 Thread Edgar Gabriel
On 9/18/2014 9:12 PM, Rob Latham wrote: On 09/18/2014 04:56 PM, Beichuan Yan wrote: Rob, Thank you very much for the suggestion. There are two independent scenarios using parallel IO in my code: 1. MPI processes conditionally print, i.e., some processes print in current loop (but may not p

[OMPI users] Can "timeout" be implemented in OpenMPI?

2014-09-19 Thread Gan, Qi PW
Hi all, I have a question about set timeout limit for MPI data transmissions. Our users run their parallel jobs (with openmpi) on our HPC cluster. Sometimes the job may hang due to unknown reason. In such case a job is still in "RUN" status, all processes of this job are running. But not outpu

Re: [OMPI users] Can "timeout" be implemented in OpenMPI?

2014-09-19 Thread George Bosilca
1. It is extremely unlike to have a broken MPI communication pipe. Use a parallel debugger to validate your communication pattern is correct. I would suspect a deadlock due to an incomplete communication pattern more than a broken communication pipe. 2. Nope, you cant set timeouts on MPI calls. Th