Can you try running invoking mpirun with its full path instead ? e.g. /usr/local/bin/mpirun instead of mpirun
Cheers, Gilles On Sunday, August 2, 2015, abhisek Mondal <abhisek.m...@gmail.com> wrote: > Here is the other details, > > a. The Openmpi version is 1.6.4 > > b. The error as being generated is : > *Warning: Permanently added 'cx0937,10.1.4.1' (RSA) to the list of known > hosts.* > *Warning: Permanently added 'cx0934,10.1.3.255' (RSA) to the list of known > hosts.* > *orted: Command not found.* > *orted: Command not found.* > > *--------------------------------------------------------------------------* > *A daemon (pid 53580) died unexpectedly with status 1 while attempting* > *to launch so we are aborting.* > > *There may be more information reported by the environment (see above).* > > *This may be because the daemon was unable to find all the needed shared* > *libraries on the remote node. You may set your LD_LIBRARY_PATH to have > the* > *location of the shared libraries on the remote nodes and this will* > *automatically be forwarded to the remote nodes.* > > *--------------------------------------------------------------------------* > > *--------------------------------------------------------------------------* > *mpirun noticed that the job aborted, but has no info as to the process* > *that caused that situation.* > > *--------------------------------------------------------------------------* > > > I'm not being able to understand why "command not found" error is being > raised. > Thank you. > > On Sun, Aug 2, 2015 at 1:43 AM, Ralph Castain <r...@open-mpi.org > <javascript:_e(%7B%7D,'cvml','r...@open-mpi.org');>> wrote: > >> Would you please tell us: >> >> (a) what version of OMPI you are using >> >> (b) what error message you are getting when the job terminates >> >> >> On Aug 1, 2015, at 12:22 PM, abhisek Mondal <abhisek.m...@gmail.com >> <javascript:_e(%7B%7D,'cvml','abhisek.m...@gmail.com');>> wrote: >> >> I'm working on an openmpi enabled cluster. I'm trying to run a job with 2 >> different nodes and 16 processors per nodes. >> Using this command: >> >> *mpirun -np 32 --hostfile myhostfile -loadbalance exe* >> >> The contents of myhostfile: >> >> *cx0937 slots=16 * >> *cx0934 slots=16* >> >> >> But the job is getting terminated each time before job allocation happens >> as per desired way. >> >> So, it'll very nice if I get some suggestions regarding the facts I'm >> missing. >> >> Thank you >> >> -- >> Abhisek Mondal >> >> *Research Fellow* >> >> *Structural Biology and Bioinformatics* >> *Indian Institute of Chemical Biology* >> >> *Kolkata 700032* >> >> *INDIA* >> _______________________________________________ >> users mailing list >> us...@open-mpi.org <javascript:_e(%7B%7D,'cvml','us...@open-mpi.org');> >> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users >> Searchable archives: >> http://www.open-mpi.org/community/lists/users/2015/08/27367.php >> >> >> >> _______________________________________________ >> users mailing list >> us...@open-mpi.org <javascript:_e(%7B%7D,'cvml','us...@open-mpi.org');> >> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users >> Link to this post: >> http://www.open-mpi.org/community/lists/users/2015/08/27367.php >> > > > > -- > Abhisek Mondal > > *Research Fellow* > > *Structural Biology and Bioinformatics* > *Indian Institute of Chemical Biology* > > *Kolkata 700032* > > *INDIA* >