Re: [OMPI users] fault tolerance in open mpi

2009-12-24 Thread vipin kumar
09, Ralph Castain wrote: > That's just OMPI's default behavior - as Josh said, we are working towards > allowing other behaviors, but for now, this is what we have. > > > On Dec 23, 2009, at 5:40 AM, vipin kumar wrote: > >> Thank you Ralph, >> >> I did as you

Re: [OMPI users] fault tolerance in open mpi

2009-12-23 Thread vipin kumar
connect is a -collective- operation. Both parent and child have to call > it. Your child process is "hanging" while it waits for the parent. > > On Dec 21, 2009, at 1:37 AM, vipin kumar wrote: > > Hello folks, > > As I explained my problem earlier, I am looking for Fau

Re: [OMPI users] fault tolerance in open mpi

2009-12-21 Thread vipin kumar
ll be sure to email the list once this work becomes more stable and > available. > > -- Josh > > > On Sep 18, 2009, at 2:56 AM, vipin kumar wrote: > > Hi Josh, >> >> It is good to hear from you that work is in progress towards resiliency of >> Open-MPI. I

Re: [OMPI users] how to SPMD on openmpi

2009-10-01 Thread vipin kumar
Hi Ankur, try this command, $ mpirun -np 2 -host firstHostIp,secondHostIp a.out for details read manual page for "mpirun". $ man mpirun Regards, On Wed, Sep 30, 2009 at 3:22 PM, ankur pachauri wrote: > Dear all, > > I have been able to install open mpi on tw

Re: [OMPI users] fault tolerance in open mpi

2009-09-18 Thread vipin kumar
g the failed >> blade. >> >> Durga >> >> On Mon, Aug 3, 2009 at 9:21 AM, jody wrote: >> >>> Hi >>> >>> I guess "task-farming" could give you a certain amount of the kind of >>> fault-tolerance you want. >>> (i.e.

Re: [OMPI users] Bus Error (7) on PS3 running HPL (OpenMPI 1.2.8)

2009-08-07 Thread vipin kumar
may be because of insufficient storage space...???( I mean hard disk space.) On Thu, Aug 6, 2009 at 11:23 PM, Jeff Squyres wrote: > Any chance you could re-try the experiment with Open MPI 1.3.3? > > > On Aug 4, 2009, at 11:10 AM, Hoelzlwimmer Andreas - S0810595005 wrote: > > Hello,

Re: [OMPI users] fault tolerance in open mpi

2009-08-03 Thread vipin kumar
> other processes as well) > > Durga > > On Thu, Jul 9, 2009 at 4:57 AM, vipin kumar wrote: > > > > Hi all, > > > > I want to know whether open mpi supports Network and process fault > tolerance > > or not? If there is any example demonstrating these

Re: [OMPI users] Network connection check

2009-07-23 Thread vipin kumar
> You don't specify and based on your description I infer that you are not > using a batch/queueing system, but just a rsh/ssh based start-up mechanism. You are absolutely correct. I am using rsh/ssh based start-up mechanism. A batch/queueing system might be able to tell you whether a remote co

Re: [OMPI users] Network connection check

2009-07-23 Thread vipin kumar
Thank you all Jeff, Jody, Prentice and Bogdan for your invaluable clarification, solution and suggestion, Open MPI should return a failure if TCP connectivity is lost, even with a > non-blocking point-to-point operation. The failure should be returned in > the call to MPI_TEST (and friends). ev

Re: [OMPI users] Network connection check

2009-07-23 Thread vipin kumar
On Thu, Jul 23, 2009 at 3:03 PM, Ralph Castain wrote: > It depends on which network fails. If you lose all TCP connectivity, Open > MPI should abort the job as the out-of-band system will detect the loss of > connection. If you only lose the MPI connection (whether TCP or some other > interconnec

Re: [OMPI users] Network connection check

2009-07-23 Thread vipin kumar
> > Are you asking to find out this information before issuing "mpirun"? Open > MPI does assume that the nodes you are trying to use are reachable. > > NO, Scenario is a pair of processes are running one in "master" node say "masterprocess" and one in "slave" node say "slaveprocess". When "maste

Re: [OMPI users] Network connection check

2009-07-22 Thread vipin kumar
purposes. That may be the same (or not) from the > resolved name that comes back for a given IP interface. > > What are you trying to check, exactly? > > > > On Jul 16, 2009, at 1:56 AM, vipin kumar wrote: > > Hi all, >> >> Is there any way to check network con

[OMPI users] Network connection check

2009-07-16 Thread vipin kumar
Hi all, Is there any way to check network connection using HostName in OpenMPI ? Thanks and Regards, -- Vipin K. Research Engineer, C-DOTB, India

Re: [OMPI users] Spawning processes through MPI::Intracomm::Spawn_multiple

2009-07-15 Thread vipin kumar
result in foo being > launched on mike, without mpirun having previous knowledge of "mike". > > You'll have to wait to try it, though, until another problem in comm_spawn > on the trunk gets fixed - hopefully, real soon (someone is working on it). > > HTH > Ralph >

[OMPI users] fault tolerance in open mpi

2009-07-09 Thread vipin kumar
Hi all, I want to know whether open mpi supports Network and process fault tolerance or not? If there is any example demonstrating these features that will be best. Regards, -- Vipin K. Research Engineer, C-DOTB, India

Re: [OMPI users] Spawning processes through MPI::Intracomm::Spawn_multiple

2009-07-02 Thread vipin kumar
command to use those features. Thanks, Regards, On Thu, Jul 2, 2009 at 12:46 PM, vipin kumar wrote: > Hi Ralph, > > I am thankful to your reply regarding the matter, however to carry forward > with my activities it would be of great help if I can know where > OpenMpi/mpirun holds the c

Re: [OMPI users] Spawning processes through MPI::Intracomm::Spawn_multiple

2009-07-02 Thread vipin kumar
, I can implement those features pretty easily. Could get to it in > the next few days - otherwise, it will have to wait another week as I will > be out. > > Either way, though, it would only be in the OMPI devel trunk until the next > feature series (1.5) has its first release. Haven&#x

Re: [OMPI users] Spawning processes through MPI::Intracomm::Spawn_multiple

2009-07-02 Thread vipin kumar
assuming that I have only local machine where I will invoke launcher program. When one remote node gets added launcher program will launch one "slave" in local machine and one "rslave" in remote machine. Is there any other way to do this. Thanks. Regards, On Mon, Jun 29, 2

Re: [OMPI users] Some Newbie questions

2009-06-29 Thread vipin kumar
epo) , and I didn't see the option > "--preload-binary" , is it because the lower version? > > Bets regards, > umanga. > > > > vipin kumar wrote: > > Hi, > I am not expert, I am user like you but I think I can help you, > > Q. After installing Ope

[OMPI users] Spawning processes through MPI::Intracomm::Spawn_multiple

2009-06-29 Thread vipin kumar
Hi all, I am trying to spawn dynamically processes in remote as well as local machine using MPI::Intracomm::Spawn_multiple. I am using MPI::Info object to tell where to launch processes using "host" key and "" as its value but but I am getting the error --

Re: [OMPI users] Some Newbie questions

2009-06-29 Thread vipin kumar
Hi, I am not expert, I am user like you but I think I can help you, Q. After installing OpenMPI on each machine ,do i need to run a service/daemon on each machine? Ans. No, not at all, Open MPI takes care of that for you. Q. How does peers in MPI environment communicate ? Ans. Using Communicator(