Re: [OMPI users] openmpi 1.6.3 fails to identify local host if its IP is 127.0.1.1

2013-06-20 Thread Ralph Castain
Been trying to decipher this problem, and think maybe I'm beginning to understand it. Just to clarify: * when you execute "hostname", you get the .local response? * you somewhere have it setup so that 10.x.x.x resolves to , with no ".local" extension? Correct? On Wed, Jun 19, 2013 at 1:17 PM,

Re: [OMPI users] openmpi 1.6.3 fails to identify local host if its IP is 127.0.1.1

2013-06-20 Thread Riccardo Murri
On 20 June 2013 06:33, Ralph Castain wrote: > Been trying to decipher this problem, and think maybe I'm beginning to > understand it. Just to clarify: > > * when you execute "hostname", you get the .local response? Yes: [rmurri@nh64-2-11 ~]$ hostname nh64-2-11.local [rmurri@nh64-2-1

Re: [OMPI users] openmpi 1.6.3 fails to identify local host if its IP is 127.0.1.1

2013-06-20 Thread Riccardo Murri
On 19 June 2013 23:52, Reuti wrote: > Am 19.06.2013 um 22:14 schrieb Riccardo Murri: > >> On 19 June 2013 20:42, Reuti wrote: >>> Am 19.06.2013 um 19:43 schrieb Riccardo Murri : >>> On 19 June 2013 16:01, Ralph Castain wrote: > How is OMPI picking up this hostfile? It isn't being specif

Re: [OMPI users] openmpi 1.6.3 fails to identify local host if its IP is 127.0.1.1

2013-06-20 Thread Lorenzo Donà
Dear all that help me thanks to everyone. I compiled open MPI with all yours advices posted but the error is always the same I'm also able to run the examples found with the package. but really I don't know what can I do to solve the problem. I trust in you to help me. Dearly Lorenzo. Il giorno

Re: [OMPI users] openmpi 1.6.3 fails to identify local host if its IP is 127.0.1.1

2013-06-20 Thread Jeff Squyres (jsquyres)
Er... are you having problems with host IP addresses 127.0.1.1, or did you reply to the wrong thread? I thought you were asking about problems with multiple mpf90's in your PATH, etc. -- not 127.0.1.1 IP address issues. IIRC, there were a bunch of suggestions over on that thread about how to

[OMPI users] error running with mpirun

2013-06-20 Thread Lorenzo Donà
Dear all that help me: THANKS for your patience with me. I was able to compile with open MPI: but now I found this error message running programs copiled with open MPI: A requested component was not found, or was unable to be opened. This means that this component is either not installed or is un

[OMPI users] Detecting Node Failure

2013-06-20 Thread Claire Williams
Hi all, I was wondering if Open-MPI had any way to detect that a node has crashed, rebooted, etc. I am currently trying to integrate my MPI application with Amazon EC2 spot instances, and since spot instances can be terminated at any time, I would like to try to make it so that my application c

Re: [OMPI users] Detecting Node Failure

2013-06-20 Thread Ralph Castain
It should detect and abort - what version are you using? Sent from my iPhone On Jun 20, 2013, at 2:02 PM, Claire Williams wrote: > Hi all, > > I was wondering if Open-MPI had any way to detect that a node has crashed, > rebooted, etc. I am currently trying to integrate my MPI application wit

Re: [OMPI users] Detecting Node Failure

2013-06-20 Thread Claire Williams
Hi Ralph, I'm using 1.4.3. Thanks - Claire From: Ralph Castain To: Claire Williams ; Open MPI Users Sent: Thursday, June 20, 2013 1:59 PM Subject: Re: [OMPI users] Detecting Node Failure It should detect and abort - what version are you using? Sent fro

Re: [OMPI users] Detecting Node Failure

2013-06-20 Thread Ralph Castain
Wow that's ancient - can u up to 1.6 series? Sent from my iPhone On Jun 20, 2013, at 3:05 PM, Claire Williams wrote: > Hi Ralph, > > I'm using 1.4.3. Thanks > > - Claire > > From: Ralph Castain > To: Claire Williams ; Open MPI Users > > Sent: Thursday, June 20, 2013 1:59 PM > Subject: R

Re: [OMPI users] Detecting Node Failure

2013-06-20 Thread Andreas Schäfer
On 14:59 Thu 20 Jun , Ralph Castain wrote: > It should detect and abort - what version are you using? Would it be possible to call MPI_Comm_disconnect() in the case the communicator in question is an intercom -- without having OMPI abort? I'm asking because if we had a possibility to dynamica

Re: [OMPI users] Detecting Node Failure

2013-06-20 Thread Jeff Squyres (jsquyres)
Not at present, no. But you might want to look at a fork of the OMPI code base that was exploring fault resilience issues: http://fault-tolerance.org/ On Jun 20, 2013, at 5:57 PM, Andreas Schäfer wrote: > On 14:59 Thu 20 Jun , Ralph Castain wrote: >> It should detect and abort - wha

Re: [OMPI users] Detecting Node Failure

2013-06-20 Thread Ralph Castain
We will also be supporting that in the developer's trunk fairly soon, and that will appear later on in the 1.9 series. On Thu, Jun 20, 2013 at 4:18 PM, Jeff Squyres (jsquyres) wrote: > Not at present, no. > > But you might want to look at a fork of the OMPI code base that was > exploring fault