[OMPI users] Error in executing NAS Benchmarks

2011-03-04 Thread vaibhav dutt
Hi,


I am trying to execute NAS benchmark across 2 Nodes, each having 4 cores.
The execution works fine on a single node, but when I try to execute the
benchmark across 2 Nodes, I
get an error like:

mpiexec -machinefile hostfile.txt -n 8 ./ep.A.8
bash: orted: command not found
--

A daemon (pid 22973) died unexpectedly with status 127 while attempting
to launch so we are aborting.

There may be more information reported by the environment (see above).

This may be because the daemon was unable to find all the needed shared
libraries on the remote node. You may set your LD_LIBRARY_PATH to have the
location of the shared libraries on the remote nodes and this will
automatically be forwarded to the remote nodes.
--
--
mpiexec noticed that the job aborted, but has no info as to the process
that caused that situation.
--
mpiexec: clean termination accomplished


Can anybody please suggest the reason for this.

Thanks,
Vaibhav


[OMPI users] Error in Binding MPI Process to a socket

2011-03-17 Thread vaibhav dutt
Hi,

I am trying to perform an experiment in which I can spawn 2 MPI processes,
one on each socket in a 4 core node
having 2 dual cores. I used the option  "bind to socket" which mpirun for
that but I am getting an error like:

An attempt was made to bind a process to a specific hardware topology
mapping (e.g., binding to a socket) but the operating system does not
support such topology-aware actions.  Talk to your local system
administrator to find out if your system can support topology-aware
functionality (e.g., Linux Kernels newer than v2.6.18).

Systems that do not support processor topology-aware functionality cannot
use "bind to socket" and other related functionality.


Can anybody please tell me what is this error about. Is there any other
option than "bind to socket"
that I can use.

Thanks.


Re: [OMPI users] Error in Binding MPI Process to a socket

2011-03-17 Thread vaibhav dutt
Hi,

Thanks for your reply. I tried to execute first a process by using

mpirun -machinefile hostfile.txt  --slot-list 0:1   -np 1

but it gives the same as error as mentioned previously.

Then, I created a rankfile with contents"

rank 0=t1.tools.xxx  slot=0:0
rank 1=t1.tools.xxx  slot=1:0.

and the  used command

mpirun -machinefile hostfile.txt --rankfile my_rankfile.txt   -np 2

but ended  up getting same error. Is there any patch that I can install in
my system to make it
topology aware?

Thanks


On Thu, Mar 17, 2011 at 2:05 PM, Ralph Castain  wrote:

> The error is telling you that your OS doesn't support queries telling us
> what cores are on which sockets, so we can't perform a "bind to socket"
> operation. You can probably still "bind to core", so if you know what cores
> are in which sockets, then you could use the rank_file mapper to assign
> processes to groups of cores in a socket.
>
> It's just that we can't do it automatically because the OS won't give us
> the required info.
>
> See "mpirun -h" for more info on slot lists.
>
> On Mar 17, 2011, at 11:26 AM, vaibhav dutt wrote:
>
> > Hi,
> >
> > I am trying to perform an experiment in which I can spawn 2 MPI
> processes, one on each socket in a 4 core node
> > having 2 dual cores. I used the option  "bind to socket" which mpirun for
> that but I am getting an error like:
> >
> > An attempt was made to bind a process to a specific hardware topology
> > mapping (e.g., binding to a socket) but the operating system does not
> > support such topology-aware actions.  Talk to your local system
> > administrator to find out if your system can support topology-aware
> > functionality (e.g., Linux Kernels newer than v2.6.18).
> >
> > Systems that do not support processor topology-aware functionality cannot
> > use "bind to socket" and other related functionality.
> >
> >
> > Can anybody please tell me what is this error about. Is there any other
> option than "bind to socket"
> > that I can use.
> >
> > Thanks.
> > ___
> > users mailing list
> > us...@open-mpi.org
> > http://www.open-mpi.org/mailman/listinfo.cgi/users
>
>
> ___
> users mailing list
> us...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/users
>


Re: [OMPI users] Error in Binding MPI Process to a socket

2011-03-17 Thread vaibhav dutt
2.6.9-78.0.17.ELpapismp #1 SMP Tue Apr 7 13:14:04 CDT 2009 x86_64 x86_64
x86_64 GNU/Linux


On Thu, Mar 17, 2011 at 2:55 PM, Ralph Castain  wrote:

> What OS version is it?
>
> uname -a
>
> will tell you, if you are on linux.
>
> On Mar 17, 2011, at 1:31 PM, vaibhav dutt wrote:
>
> Hi,
>
> Thanks for your reply. I tried to execute first a process by using
>
> mpirun -machinefile hostfile.txt  --slot-list 0:1   -np 1
>
> but it gives the same as error as mentioned previously.
>
> Then, I created a rankfile with contents"
>
> rank 0=t1.tools.xxx  slot=0:0
> rank 1=t1.tools.xxx  slot=1:0.
>
> and the  used command
>
> mpirun -machinefile hostfile.txt --rankfile my_rankfile.txt   -np 2
>
> but ended  up getting same error. Is there any patch that I can install in
> my system to make it
> topology aware?
>
> Thanks
>
>
> On Thu, Mar 17, 2011 at 2:05 PM, Ralph Castain  wrote:
>
>> The error is telling you that your OS doesn't support queries telling us
>> what cores are on which sockets, so we can't perform a "bind to socket"
>> operation. You can probably still "bind to core", so if you know what cores
>> are in which sockets, then you could use the rank_file mapper to assign
>> processes to groups of cores in a socket.
>>
>> It's just that we can't do it automatically because the OS won't give us
>> the required info.
>>
>> See "mpirun -h" for more info on slot lists.
>>
>> On Mar 17, 2011, at 11:26 AM, vaibhav dutt wrote:
>>
>> > Hi,
>> >
>> > I am trying to perform an experiment in which I can spawn 2 MPI
>> processes, one on each socket in a 4 core node
>> > having 2 dual cores. I used the option  "bind to socket" which mpirun
>> for that but I am getting an error like:
>> >
>> > An attempt was made to bind a process to a specific hardware topology
>> > mapping (e.g., binding to a socket) but the operating system does not
>> > support such topology-aware actions.  Talk to your local system
>> > administrator to find out if your system can support topology-aware
>> > functionality (e.g., Linux Kernels newer than v2.6.18).
>> >
>> > Systems that do not support processor topology-aware functionality
>> cannot
>> > use "bind to socket" and other related functionality.
>> >
>> >
>> > Can anybody please tell me what is this error about. Is there any other
>> option than "bind to socket"
>> > that I can use.
>> >
>> > Thanks.
>> > ___
>> > users mailing list
>> > us...@open-mpi.org
>> > http://www.open-mpi.org/mailman/listinfo.cgi/users
>>
>>
>> ___
>> users mailing list
>> us...@open-mpi.org
>> http://www.open-mpi.org/mailman/listinfo.cgi/users
>>
>
> ___
> users mailing list
> us...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/users
>
>
>
> ___
> users mailing list
> us...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/users
>


[OMPI users] HWLOC problem

2011-06-07 Thread vaibhav dutt
Hi,

I have installed HWLOC 1.2 on my cluster , each node has two Intel Xeon
E5450 quad cores.
When I try to execute the command "lstopo" to determine the hardware
topology of my system,
I get an error like:

./lstopo: error while loading shared libraries: libhwloc.so.3: cannot open
shared object file: No such file or directory


Can anyone please help me as to what is the reason for this error and where
can I find this shared
library.

Thanks.


[OMPI users] Polling and Interrupt

2012-05-14 Thread vaibhav dutt
Hi all,

Can anybody tell me how to enable the polling and interrupt/blocking
execution in
OpenMPI?

Thanks


[OMPI users] Problem in using openmpi

2010-03-12 Thread vaibhav dutt
Hi,

I have installed openmpi on an Kubuntu , with Dual core Linux AMD Athlon
When trying to compile a simple program, I am getting an error.

mpicc: error while loading shared libraries: libopen-pal.so.0: cannot
open shared object file: No such file or dir

I read somewhere that this error is because of some intel compiler
being not installed on the proper node, which I don't understand as I
am using AMD.

Kindly give your suggestions

Thank You


Re: [OMPI users] Problem in using openmpi

2010-03-12 Thread vaibhav dutt
Hi,

I used the export command but it does not seem to work.It's still giving the
same error as
the lib64 directory does not exist in the ompi folder.

Any Suggestions.

Thank You,
Vaibhav



On Fri, Mar 12, 2010 at 3:05 PM, Fernando Lemos wrote:

> On Fri, Mar 12, 2010 at 6:02 PM, Samuel K. Gutierrez 
> wrote:
> > One more thing.  The line should have been:
> >
> > export LD_LIBRARY_PATH=/home/jess/local/ompi/lib64
> >
> > The space in the previous email will make bash unhappy 8-|.
> >
> > --
> > Samuel K. Gutierrez
> > Los Alamos National Laboratory
> >
> > On Mar 12, 2010, at 1:56 PM, Samuel K. Gutierrez wrote:
> >
> >> Hi,
> >>
> >> It sounds like you may need to set your LD_LIBRARY_PATH environment
> >> variable correctly.  There are several ways that you can tell the
> dynamic
> >> linker where the required libraries are located, but the following may
> be
> >> sufficient for your needs.
> >>
> >> Let's say, for example, that your Open MPI installation is rooted at
> >> /home/jess/local/ompi and the libraries are located in
> >> /home/jess/local/ompi/lib64, try (bash-like shell):
> >>
> >> export LD_LIBRARY_PATH= /home/jess/local/ompi/lib64
> >>
> >> Hope this helps,
> >>
> >> --
> >> Samuel K. Gutierrez
> >> Los Alamos National Laboratory
> >>
> >> On Mar 12, 2010, at 1:32 PM, vaibhav dutt wrote:
> >>
> >>> Hi,
> >>>
> >>> I have installed openmpi on an Kubuntu , with Dual core Linux AMD
> Athlon
> >>> When trying to compile a simple program, I am getting an error.
> >>>
> >>> mpicc: error while loading shared libraries: libopen-pal.so.0: cannot
> >>> open shared object file: No such file or dir
> >>>
> >>> I read somewhere that this error is because of some intel compiler
> >>> being not installed on the proper node, which I don't understand as I
> >>> am using AMD.
> >>>
> >>> Kindly give your suggestions
> >>>
> >>> Thank You
>
> It's probably a packaging error, if he used the distribution's
> packages. In that case, he should report the bug to downstream.
>
> If he installed from source, then it's most likely installed somewhere
> outside the library path, and the LD_LIBRARY_PATH trick might work (if
> it doesn't, make sure there are no leftovers, recompile, reinstall and
> it should work fine).
>
>
> Regards,
>
> ___
> users mailing list
> us...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/users
>


[OMPI users] MPI Literature Survey on Multicores

2010-04-06 Thread vaibhav dutt
Hi all,

I am doing a literature survey on MPI optimizations on Multi Cores and was
searching for
some good papers.I have got some papers on MPI Intra Node Data Transfer. Can
anybody please
suggest me how to go about it I mean how can I organize the Survey and also
some good source of
papers on the topics.

Thank You,