Re: [OMPI users] Failure detection

2015-11-07 Thread Cristian Camilo Ruiz Sanabria
installed through Debian packages. - Mail original - > De: "Ralph Castain" > À: "Open MPI Users" > Envoyé: Samedi 7 Novembre 2015 17:22:28 > Objet: Re: [OMPI users] Failure detection > > No, that certainly isn’t the normal behavior. I suspect it has

Re: [OMPI users] Failure detection

2015-11-07 Thread Ralph Castain
No, that certainly isn’t the normal behavior. I suspect it has to do with the nature of the VM TCP connection, though there is something very strange about your output. The BTL message indicates that an MPI job is already running. Yet your subsequent ORTE error message indicates we are still try

[OMPI users] Failure detection

2015-11-07 Thread Cristian RUIZ
Hello, I was studying how OpenMPI reacts to failures. I have a virtual infrastructure where failures can be emulated by turning off a given VM. Depending on the way the VM is turned off the 'mpirun' will be notified, either because it receives a signal or because some timeout is reached. In bo