Re: [OMPI users] OPENSHMEM ERROR with 2+ Distributed Machines

2016-08-19 Thread Debendra Das
The problem has been solved with the latest snapshot.Thanks a lot for your help. Thanking You Debendra On Wed, Aug 17, 2016 at 11:34 PM, Jeff Squyres (jsquyres) < jsquy...@cisco.com> wrote: > Debendra - > > A fix has been submitted for the v2.0.1 release. Could you give it a try > with the late

Re: [OMPI users] OPENSHMEM ERROR with 2+ Distributed Machines

2016-08-17 Thread Jeff Squyres (jsquyres)
Debendra - A fix has been submitted for the v2.0.1 release. Could you give it a try with the latest snapshot (anything dated on or after Aug 17): https://www.open-mpi.org/nightly/v2.x/ > On Aug 16, 2016, at 6:21 AM, Gilles Gouaillardet > wrote: > > assuming you have an infiniband networ

Re: [OMPI users] OPENSHMEM ERROR with 2+ Distributed Machines

2016-08-16 Thread Gilles Gouaillardet
assuming you have an infiniband network, an other option is to install mxm (mellanox proprietary but free library) and rebuild Open MPI. pml/yalla will be used instead of ob1 and you should be just fine Cheers, Gilles On Tuesday, August 16, 2016, Jeff Squyres (jsquyres) wrote: > On Aug 16, 201

Re: [OMPI users] OPENSHMEM ERROR with 2+ Distributed Machines

2016-08-16 Thread Jeff Squyres (jsquyres)
On Aug 16, 2016, at 6:09 AM, Debendra Das wrote: > > As far as I understood I have to wait for version 2.0.1 to fix the issue.So > can you please give any idea about when 2.0.1 will be released. We had hoped to release it today, actually. :-\ But there's still a few issues we're working out

Re: [OMPI users] OPENSHMEM ERROR with 2+ Distributed Machines

2016-08-16 Thread Debendra Das
As far as I understood I have to wait for version 2.0.1 to fix the issue.So can you please give any idea about when 2.0.1 will be released.Also I could not understand how to use the patch. Thanking You, Debendranath Das On Mon, Aug 15, 2016 at 8:27 AM, Gilles Gouaillardet wrote: > Thanks for bo

Re: [OMPI users] OPENSHMEM ERROR with 2+ Distributed Machines

2016-08-14 Thread Gilles Gouaillardet
Thanks for both the report and posting the logs in a plain text file. i opened https://github.com/open-mpi/ompi/issues/1966 to track this issue, it contains a patch that fixes/works around this issue. Cheers, Gilles On 8/14/2016 7:39 PM, Debendra Das wrote: I have installed OpenMPI-2.0.0

Re: [OMPI users] OPENSHMEM ERROR with 2+ Distributed Machines

2016-08-14 Thread Debendra Das
I have installed OpenMPI-2.0.0 in 5 systems with IP addresses 172.16.5.29, 172.16.5.30, 172.16.5.31, 172.16.5.32, 172.16.5.33.While executing the hello_oshmem_c.c program (under the examples directory) , correct output is coming only when execution is done using 2 distributed machines.But error is

Re: [OMPI users] OPENSHMEM ERROR with 2+ Distributed Machines

2016-08-12 Thread r...@open-mpi.org
Just as a suggestion: most of us are leery of opening Word attachments on mailing lists. I’d suggest sending this to us as plain text if you want us to read it. > On Aug 12, 2016, at 4:03 AM, Debendra Das wrote: > > I have installed OpenMPI-2.0.0 in 5 systems with IP addresses 172.16.5.29, >

[OMPI users] OPENSHMEM ERROR with 2+ Distributed Machines

2016-08-12 Thread Debendra Das
I have installed OpenMPI-2.0.0 in 5 systems with IP addresses 172.16.5.29, 172.16.5.30, 172.16.5.31, 172.16.5.32, 172.16.5.33.While executing the hello_oshmem_c.c program (under the examples directory) , correct output is coming only when executing is done using 2 distributed machines.But error is

Re: [OMPI users] OPENSHMEM ERROR

2016-08-06 Thread Debendra Das
I tried with the latest v2.0.1 nightly snapshot tarball, but still the problem exists. Thanks. On Tue, Aug 2, 2016 at 9:34 PM, Debendra Das wrote: > Dear Sir, > > I am really thankful for your help and I will definitely try the latest > version to see whether the problem is resolved or not. >

Re: [OMPI users] OPENSHMEM ERROR

2016-08-02 Thread Debendra Das
Dear Sir, I am really thankful for your help and I will definitely try the latest version to see whether the problem is resolved or not. Thanks. On Tue, Aug 2, 2016 at 8:13 PM, Jeff Squyres (jsquyres) wrote: > Debendra -- > > Can you try the latest v2.0.1 nightly snapshot tarball and see if t

Re: [OMPI users] OPENSHMEM ERROR

2016-08-02 Thread Jeff Squyres (jsquyres)
Debendra -- Can you try the latest v2.0.1 nightly snapshot tarball and see if the problem is resolved for you? https://www.open-mpi.org/nightly/v2.x/ Thanks. > On Jul 29, 2016, at 12:58 PM, Jeff Squyres (jsquyres) > wrote: > >> On Jul 29, 2016, at 8:49 AM, Jeff Squyres (jsquyres) >>

Re: [OMPI users] OPENSHMEM ERROR

2016-07-29 Thread Jeff Squyres (jsquyres)
> On Jul 29, 2016, at 8:49 AM, Jeff Squyres (jsquyres) > wrote: > > What happens when you run the ring_c test program, do you get the same error > as you do with hello_oshmem_c? I'm guessing ring_c will work, but oshmem_hello will still segv; I was just able to reproduce the problem. I file

Re: [OMPI users] OPENSHMEM ERROR

2016-07-29 Thread Jeff Squyres (jsquyres)
What happens when you run the ring_c test program, do you get the same error as you do with hello_oshmem_c? Can you send all the information listed here: https://www.open-mpi.org/community/help/ > On Jul 29, 2016, at 6:15 AM, Debendra Das wrote: > > I have installed OpenMPI-2.0.0 in 2 s

[OMPI users] OPENSHMEM ERROR

2016-07-29 Thread Debendra Das
I have installed OpenMPI-2.0.0 in 2 systems with IP addresses 172.16.5.33 and 172.16.5.32. I have compiled the hello_c.c and hello_oshmem_c.c files which are in the examples directory. The respective object files are hello_c and hello_oshmem_c. When I am executing hello_c it is running fine, but wh