Thanks a lot Gus for your help. Although I used stage_in/stage_out features before, I found NFS mounting much better and cleaner.
Best Regards. ~Belaid. > Date: Tue, 1 Dec 2009 14:55:53 -0500 > From: g...@ldeo.columbia.edu > To: us...@open-mpi.org > Subject: Re: [OMPI users] Elementary question on openMPI application location > when using PBS submission > > Hi Belaid Moa > > Belaid MOA wrote: > > Thank you very very much Gus. Does this mean that OpenMPI does not copy > > the executable from the master node to the worker nodes? > > Not that I know. > Making the executable available on the nodes, and any > input files the program may need, is the user's responsibility, > not of mpiexec. > > On the other hand, > Torque/PBS has a "stage_in/stage_out" feature that is supposed to > copy files over to the nodes, if you want to give it a shot. > See "man qsub" and look into the (numerous) "-W" option under > the "stage[in,out]=file_list" sub-options. > This is a relic from the old days where everything had to be on > local disks on the nodes, and NFS ran over Ethernet 10/100, > but it is still used by people that > run MPI programs with heavy I/O, to avoid pounding on NFS or > even on parallel file systems. > I tried the stage_in/out feature a loooong time ago, > (old PBS before Torque), but it had issues. > It probably works now with the newer/better > versions of Torque. > > However, the easy way to get this right is just to use an NFS mounted > directory. > > > If that's case, I will go ahead and NFS mount my working directory. > > > > This would make your life much easier. > > My $0.02. > Gus Correa > --------------------------------------------------------------------- > Gustavo Correa > Lamont-Doherty Earth Observatory - Columbia University > Palisades, NY, 10964-8000 - USA > --------------------------------------------------------------------- > > > > > > ~Belaid. > > > > > > > Date: Tue, 1 Dec 2009 13:50:57 -0500 > > > From: g...@ldeo.columbia.edu > > > To: us...@open-mpi.org > > > Subject: Re: [OMPI users] Elementary question on openMPI application > > location when using PBS submission > > > > > > Hi Belaid MOA > > > > > > See this FAQ: > > > > > http://www.open-mpi.org/faq/?category=running#do-i-need-a-common-filesystem > > > http://www.open-mpi.org/faq/?category=building#where-to-install > > > http://www.open-mpi.org/faq/?category=tm#tm-obtain-host > > > > > > Your executable needs to be on a directory that is accessible > > > by all nodes in your node pool. > > > An easy way to achieve this is to put it in a directory that > > > is NFS mounted on all nodes, and launch your pbs script from there. > > > > > > A less convenient alternative, if no NFS directory is available, > > > is to copy the executable over to the nodes. > > > > > > I also find it easier to write a PBS script instead of putting > > > all the PBS directives in the command line. > > > In this case you can put the lines below in your PBS script, > > > to ensure all nodes will be on your work directory (cd $PBS_O_WORKDIR): > > > > > > ######## > > > > > > #PBS ... (PBS directives) > > > ... > > > cd $PBS_O_WORKDIR > > > mpiexec -n .... > > > > > > ######## > > > > > > IIRR, by default Torque/PBS puts you in your home directory on > > > the nodes, which may or may not be the location of your executable. > > > > > > I hope this helps, > > > Gus Correa > > > --------------------------------------------------------------------- > > > Gustavo Correa > > > Lamont-Doherty Earth Observatory - Columbia University > > > Palisades, NY, 10964-8000 - USA > > > --------------------------------------------------------------------- > > > > > > Belaid MOA wrote: > > > > Hello everyone, > > > > I am new to this list and I have a very elementary question: > > suppose we > > > > have three machines, HN (Head Node hosting the pbs server), WN1 (A > > > > worker node) and WN (another worker node). The PBS nodefile has WN1 > > and > > > > WN2 in it (DOES NOT HAVE HN). > > > > My openMPI program (hello) and PBS script(my_script.sh) reside on the > > > > HN. When I submit my PBS script using qsub -l nodes=2 my_script.sh, I > > > > get the following error: > > > > > > > > > > -------------------------------------------------------------------------- > > > > mpirun was unable to launch the specified application as it could not > > > > find an executable: > > > > > > > > Executable: hello > > > > Node: WN2 > > > > > > > > while attempting to start process rank 0. > > > > > > -------------------------------------------------------------------------- > > > > > > > > How come my hello program is not copied automatically to the worker > > > > nodes? This leads to my elementary question: > > > > where the application should be when using a PBS submission? > > > > > > > > Note that when I run mpirun from HN with machinefile containing WN1 > > and > > > > WN2, I get the right output. > > > > > > > > Any help on this is very appreciated. > > > > > > > > ~Belaid. > > > > > > > > > > > > > > ------------------------------------------------------------------------ > > > > Windows Live: Keep your friends up to date with what you do online. > > > > <http://go.microsoft.com/?linkid=9691810> > > > > > > > > > > > > > > ------------------------------------------------------------------------ > > > > > > > > _______________________________________________ > > > > users mailing list > > > > us...@open-mpi.org > > > > http://www.open-mpi.org/mailman/listinfo.cgi/users > > > > > > _______________________________________________ > > > users mailing list > > > us...@open-mpi.org > > > http://www.open-mpi.org/mailman/listinfo.cgi/users > > > > ------------------------------------------------------------------------ > > Windows Live: Make it easier for your friends to see what you’re up to > > on Facebook. <http://go.microsoft.com/?linkid=9691811> > > > > > > ------------------------------------------------------------------------ > > > > _______________________________________________ > > users mailing list > > us...@open-mpi.org > > http://www.open-mpi.org/mailman/listinfo.cgi/users > > _______________________________________________ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users _________________________________________________________________ Windows Live: Keep your friends up to date with what you do online. http://go.microsoft.com/?linkid=9691815