Re: [OMPI users] Network connection check

2009-07-27 Thread Jeff Squyres
On Jul 23, 2009, at 10:41 AM, Durga Choudhury wrote: The 'system' command will fork a separate process to run. If I remember correctly, forking within MPI can lead to undefined behavior. Can someone in OpenMPI development team clarify? It depends on your network, but in general, we don't reco

Re: [OMPI users] Network connection check

2009-07-23 Thread vipin kumar
> You don't specify and based on your description I infer that you are not > using a batch/queueing system, but just a rsh/ssh based start-up mechanism. You are absolutely correct. I am using rsh/ssh based start-up mechanism. A batch/queueing system might be able to tell you whether a remote co

Re: [OMPI users] Network connection check

2009-07-23 Thread Durga Choudhury
The 'system' command will fork a separate process to run. If I remember correctly, forking within MPI can lead to undefined behavior. Can someone in OpenMPI development team clarify? What I don't understand is: why is your TCP network so unstable that you are worried about reachability? For MPI to

Re: [OMPI users] Network connection check

2009-07-23 Thread vipin kumar
Thank you all Jeff, Jody, Prentice and Bogdan for your invaluable clarification, solution and suggestion, Open MPI should return a failure if TCP connectivity is lost, even with a > non-blocking point-to-point operation. The failure should be returned in > the call to MPI_TEST (and friends). ev

Re: [OMPI users] Network connection check

2009-07-23 Thread Bogdan Costescu
On Thu, 23 Jul 2009, vipin kumar wrote: 1: Slave machine is reachable or not, (How I will do that ??? Given - I have IP address and Host Name of Slave machine.) 2: if reachable, check whether program(orted and "slaveprocess") is alive or not. You don't specify and based on your description

Re: [OMPI users] Network connection check

2009-07-23 Thread Prentice Bisbal
Jeff Squyres wrote: > On Jul 22, 2009, at 10:05 AM, vipin kumar wrote: > >> Actually requirement is how a C/C++ program running in "master" node >> should find out whether "slave" node is reachable (as we check this >> using "ping" command) or not ? Because IP address may change at any >> time, t

Re: [OMPI users] Network connection check

2009-07-23 Thread jody
Maybe you could make a system call to ping the other machine. char sCommand[512]; // build the command string sprintf(sCommand, "ping -c %d -q %s > /dev/null", numPings, sHostName); // execute the command int iResult =system(sCommand); If the ping was successful, iResult will h

Re: [OMPI users] Network connection check

2009-07-23 Thread Jeff Squyres
On Jul 23, 2009, at 7:36 AM, vipin kumar wrote: I can't use blocking communication routines in my main program ( "masterprocess") because any type of network failure( may be due to physical connectivity or TCP connectivity or MPI connection as you told) may occur. So I am using non blocking

Re: [OMPI users] Network connection check

2009-07-23 Thread vipin kumar
On Thu, Jul 23, 2009 at 3:03 PM, Ralph Castain wrote: > It depends on which network fails. If you lose all TCP connectivity, Open > MPI should abort the job as the out-of-band system will detect the loss of > connection. If you only lose the MPI connection (whether TCP or some other > interconnec

Re: [OMPI users] Network connection check

2009-07-23 Thread Ralph Castain
It depends on which network fails. If you lose all TCP connectivity, Open MPI should abort the job as the out-of-band system will detect the loss of connection. If you only lose the MPI connection (whether TCP or some other interconnect), then I believe the system will eventually generate a

Re: [OMPI users] Network connection check

2009-07-23 Thread vipin kumar
> > Are you asking to find out this information before issuing "mpirun"? Open > MPI does assume that the nodes you are trying to use are reachable. > > NO, Scenario is a pair of processes are running one in "master" node say "masterprocess" and one in "slave" node say "slaveprocess". When "maste

Re: [OMPI users] Network connection check

2009-07-22 Thread Jeff Squyres
On Jul 22, 2009, at 10:05 AM, vipin kumar wrote: Actually requirement is how a C/C++ program running in "master" node should find out whether "slave" node is reachable (as we check this using "ping" command) or not ? Because IP address may change at any time, that's why I am trying to achie

Re: [OMPI users] Network connection check

2009-07-22 Thread vipin kumar
Hi Jeff, Thanks for your response. Actually requirement is how a C/C++ program running in "master" node should find out whether "slave" node is reachable (as we check this using "ping" command) or not ? Because IP address may change at any time, that's why I am trying to achieve this using "host

Re: [OMPI users] Network connection check

2009-07-22 Thread Jeff Squyres
I'm not sure what you mean. Open MPI uses the hostname of the machine for general identification purposes. That may be the same (or not) from the resolved name that comes back for a given IP interface. What are you trying to check, exactly? On Jul 16, 2009, at 1:56 AM, vipin kumar wrote:

[OMPI users] Network connection check

2009-07-16 Thread vipin kumar
Hi all, Is there any way to check network connection using HostName in OpenMPI ? Thanks and Regards, -- Vipin K. Research Engineer, C-DOTB, India