Re: [OMPI users] Question on run-time error "ORTE was unable to reliably start"

2016-07-28 Thread Ralph Castain
What kind of system was this on? ssh, slurm, ...? > On Jul 28, 2016, at 1:55 PM, Blosch, Edwin L wrote: > > I am running cases that are starting just fine and running for a few hours, > then they die with a message that seems like a startup type of failure. > Message shown below. The messag

[OMPI users] Question on run-time error "ORTE was unable to reliably start"

2016-07-28 Thread Blosch, Edwin L
I am running cases that are starting just fine and running for a few hours, then they die with a message that seems like a startup type of failure. Message shown below. The message appears in standard output from rank 0 process. I'm assuming there is a failing card or port or something. What