[OMPI users] Error in executing NAS Benchmarks
Hi, I am trying to execute NAS benchmark across 2 Nodes, each having 4 cores. The execution works fine on a single node, but when I try to execute the benchmark across 2 Nodes, I get an error like: mpiexec -machinefile hostfile.txt -n 8 ./ep.A.8 bash: orted: command not found -- A daemon (pid 22973) died unexpectedly with status 127 while attempting to launch so we are aborting. There may be more information reported by the environment (see above). This may be because the daemon was unable to find all the needed shared libraries on the remote node. You may set your LD_LIBRARY_PATH to have the location of the shared libraries on the remote nodes and this will automatically be forwarded to the remote nodes. -- -- mpiexec noticed that the job aborted, but has no info as to the process that caused that situation. -- mpiexec: clean termination accomplished Can anybody please suggest the reason for this. Thanks, Vaibhav
[OMPI users] Error in Binding MPI Process to a socket
Hi, I am trying to perform an experiment in which I can spawn 2 MPI processes, one on each socket in a 4 core node having 2 dual cores. I used the option "bind to socket" which mpirun for that but I am getting an error like: An attempt was made to bind a process to a specific hardware topology mapping (e.g., binding to a socket) but the operating system does not support such topology-aware actions. Talk to your local system administrator to find out if your system can support topology-aware functionality (e.g., Linux Kernels newer than v2.6.18). Systems that do not support processor topology-aware functionality cannot use "bind to socket" and other related functionality. Can anybody please tell me what is this error about. Is there any other option than "bind to socket" that I can use. Thanks.
Re: [OMPI users] Error in Binding MPI Process to a socket
Hi, Thanks for your reply. I tried to execute first a process by using mpirun -machinefile hostfile.txt --slot-list 0:1 -np 1 but it gives the same as error as mentioned previously. Then, I created a rankfile with contents" rank 0=t1.tools.xxx slot=0:0 rank 1=t1.tools.xxx slot=1:0. and the used command mpirun -machinefile hostfile.txt --rankfile my_rankfile.txt -np 2 but ended up getting same error. Is there any patch that I can install in my system to make it topology aware? Thanks On Thu, Mar 17, 2011 at 2:05 PM, Ralph Castain wrote: > The error is telling you that your OS doesn't support queries telling us > what cores are on which sockets, so we can't perform a "bind to socket" > operation. You can probably still "bind to core", so if you know what cores > are in which sockets, then you could use the rank_file mapper to assign > processes to groups of cores in a socket. > > It's just that we can't do it automatically because the OS won't give us > the required info. > > See "mpirun -h" for more info on slot lists. > > On Mar 17, 2011, at 11:26 AM, vaibhav dutt wrote: > > > Hi, > > > > I am trying to perform an experiment in which I can spawn 2 MPI > processes, one on each socket in a 4 core node > > having 2 dual cores. I used the option "bind to socket" which mpirun for > that but I am getting an error like: > > > > An attempt was made to bind a process to a specific hardware topology > > mapping (e.g., binding to a socket) but the operating system does not > > support such topology-aware actions. Talk to your local system > > administrator to find out if your system can support topology-aware > > functionality (e.g., Linux Kernels newer than v2.6.18). > > > > Systems that do not support processor topology-aware functionality cannot > > use "bind to socket" and other related functionality. > > > > > > Can anybody please tell me what is this error about. Is there any other > option than "bind to socket" > > that I can use. > > > > Thanks. > > ___ > > users mailing list > > us...@open-mpi.org > > http://www.open-mpi.org/mailman/listinfo.cgi/users > > > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users >
Re: [OMPI users] Error in Binding MPI Process to a socket
2.6.9-78.0.17.ELpapismp #1 SMP Tue Apr 7 13:14:04 CDT 2009 x86_64 x86_64 x86_64 GNU/Linux On Thu, Mar 17, 2011 at 2:55 PM, Ralph Castain wrote: > What OS version is it? > > uname -a > > will tell you, if you are on linux. > > On Mar 17, 2011, at 1:31 PM, vaibhav dutt wrote: > > Hi, > > Thanks for your reply. I tried to execute first a process by using > > mpirun -machinefile hostfile.txt --slot-list 0:1 -np 1 > > but it gives the same as error as mentioned previously. > > Then, I created a rankfile with contents" > > rank 0=t1.tools.xxx slot=0:0 > rank 1=t1.tools.xxx slot=1:0. > > and the used command > > mpirun -machinefile hostfile.txt --rankfile my_rankfile.txt -np 2 > > but ended up getting same error. Is there any patch that I can install in > my system to make it > topology aware? > > Thanks > > > On Thu, Mar 17, 2011 at 2:05 PM, Ralph Castain wrote: > >> The error is telling you that your OS doesn't support queries telling us >> what cores are on which sockets, so we can't perform a "bind to socket" >> operation. You can probably still "bind to core", so if you know what cores >> are in which sockets, then you could use the rank_file mapper to assign >> processes to groups of cores in a socket. >> >> It's just that we can't do it automatically because the OS won't give us >> the required info. >> >> See "mpirun -h" for more info on slot lists. >> >> On Mar 17, 2011, at 11:26 AM, vaibhav dutt wrote: >> >> > Hi, >> > >> > I am trying to perform an experiment in which I can spawn 2 MPI >> processes, one on each socket in a 4 core node >> > having 2 dual cores. I used the option "bind to socket" which mpirun >> for that but I am getting an error like: >> > >> > An attempt was made to bind a process to a specific hardware topology >> > mapping (e.g., binding to a socket) but the operating system does not >> > support such topology-aware actions. Talk to your local system >> > administrator to find out if your system can support topology-aware >> > functionality (e.g., Linux Kernels newer than v2.6.18). >> > >> > Systems that do not support processor topology-aware functionality >> cannot >> > use "bind to socket" and other related functionality. >> > >> > >> > Can anybody please tell me what is this error about. Is there any other >> option than "bind to socket" >> > that I can use. >> > >> > Thanks. >> > ___ >> > users mailing list >> > us...@open-mpi.org >> > http://www.open-mpi.org/mailman/listinfo.cgi/users >> >> >> ___ >> users mailing list >> us...@open-mpi.org >> http://www.open-mpi.org/mailman/listinfo.cgi/users >> > > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users > > > > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users >
[OMPI users] HWLOC problem
Hi, I have installed HWLOC 1.2 on my cluster , each node has two Intel Xeon E5450 quad cores. When I try to execute the command "lstopo" to determine the hardware topology of my system, I get an error like: ./lstopo: error while loading shared libraries: libhwloc.so.3: cannot open shared object file: No such file or directory Can anyone please help me as to what is the reason for this error and where can I find this shared library. Thanks.
[OMPI users] Polling and Interrupt
Hi all, Can anybody tell me how to enable the polling and interrupt/blocking execution in OpenMPI? Thanks
[OMPI users] Problem in using openmpi
Hi, I have installed openmpi on an Kubuntu , with Dual core Linux AMD Athlon When trying to compile a simple program, I am getting an error. mpicc: error while loading shared libraries: libopen-pal.so.0: cannot open shared object file: No such file or dir I read somewhere that this error is because of some intel compiler being not installed on the proper node, which I don't understand as I am using AMD. Kindly give your suggestions Thank You
Re: [OMPI users] Problem in using openmpi
Hi, I used the export command but it does not seem to work.It's still giving the same error as the lib64 directory does not exist in the ompi folder. Any Suggestions. Thank You, Vaibhav On Fri, Mar 12, 2010 at 3:05 PM, Fernando Lemos wrote: > On Fri, Mar 12, 2010 at 6:02 PM, Samuel K. Gutierrez > wrote: > > One more thing. The line should have been: > > > > export LD_LIBRARY_PATH=/home/jess/local/ompi/lib64 > > > > The space in the previous email will make bash unhappy 8-|. > > > > -- > > Samuel K. Gutierrez > > Los Alamos National Laboratory > > > > On Mar 12, 2010, at 1:56 PM, Samuel K. Gutierrez wrote: > > > >> Hi, > >> > >> It sounds like you may need to set your LD_LIBRARY_PATH environment > >> variable correctly. There are several ways that you can tell the > dynamic > >> linker where the required libraries are located, but the following may > be > >> sufficient for your needs. > >> > >> Let's say, for example, that your Open MPI installation is rooted at > >> /home/jess/local/ompi and the libraries are located in > >> /home/jess/local/ompi/lib64, try (bash-like shell): > >> > >> export LD_LIBRARY_PATH= /home/jess/local/ompi/lib64 > >> > >> Hope this helps, > >> > >> -- > >> Samuel K. Gutierrez > >> Los Alamos National Laboratory > >> > >> On Mar 12, 2010, at 1:32 PM, vaibhav dutt wrote: > >> > >>> Hi, > >>> > >>> I have installed openmpi on an Kubuntu , with Dual core Linux AMD > Athlon > >>> When trying to compile a simple program, I am getting an error. > >>> > >>> mpicc: error while loading shared libraries: libopen-pal.so.0: cannot > >>> open shared object file: No such file or dir > >>> > >>> I read somewhere that this error is because of some intel compiler > >>> being not installed on the proper node, which I don't understand as I > >>> am using AMD. > >>> > >>> Kindly give your suggestions > >>> > >>> Thank You > > It's probably a packaging error, if he used the distribution's > packages. In that case, he should report the bug to downstream. > > If he installed from source, then it's most likely installed somewhere > outside the library path, and the LD_LIBRARY_PATH trick might work (if > it doesn't, make sure there are no leftovers, recompile, reinstall and > it should work fine). > > > Regards, > > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users >
[OMPI users] MPI Literature Survey on Multicores
Hi all, I am doing a literature survey on MPI optimizations on Multi Cores and was searching for some good papers.I have got some papers on MPI Intra Node Data Transfer. Can anybody please suggest me how to go about it I mean how can I organize the Survey and also some good source of papers on the topics. Thank You,