Re: [OMPI users] Error using hostfile

2011-07-11 Thread Mohan, Ashwin
Thank you Ralph. I was able to ssh back and forth between nodes. It also seemed that the environment variables were all set fine. Turning the firewall down seems to make this work just fine. From: users-boun...@open-mpi.org [mailto:users-boun...@open-mpi.org] On Behalf Of Ralph Castain Sent: F

Re: [OMPI users] Error-Open MPI over Infiniband: polling LP CQ with status LOCAL LENGTH ERROR

2011-07-11 Thread yanyg
Hi Yevgeny, Thanks. Here is the output of /usr/bin/ibv_devinfo: hca_id: mlx4_0 transport: InfiniBand (0) fw_ver: 2.8.000 node_guid: 0002:c903:0010:a85a sys_image_gui

Re: [OMPI users] Mpirun only works when n< 3

2011-07-11 Thread Randolph Pullen
I have discovered slightly more information:When I replace node 'B' from the new cluster with node 'C' from the old clusterI get the similar behavior but with an error message:mpirun -H A,A,A,A,A,A,A  ring     (works from either node) mpirun -H C,C,C  ring     (works from either node) mpirun -H A

Re: [OMPI users] Mpirun only works when n< 3

2011-07-11 Thread Randolph Pullen
There are no firewalls by default.  I can ssh between both nodes without a password so I assumed that all is good with the comms.I can also get both nodes to participate in the ring program at the same time.Its just that I am limited to inly 2 processes if they are split between the nodes ie:mpi

Re: [OMPI users] OpenMPI with NAG compiler and gcc 4.6

2011-07-11 Thread Jeff Squyres
I'm going to move this over to the devel list... On Jul 11, 2011, at 4:17 AM, Ning Li wrote: > Hi Jeff, > > I am willing to help test OpenMPI with the NAG compiler from time to time but > not sure how. If you could give me specific instructions I am very happy to > help. > > As for this techn

Re: [OMPI users] InfiniBand, different OpenFabrics transport types

2011-07-11 Thread Bill Johnstone
Hi Yevgeny and list, - Original Message - > From: Yevgeny Kliteynik > I'll check the MCA_BTL_OPENIB_TRANSPORT_UNKNOWN thing and get back to you. Thank you. > One question though, just to make sure we're on the same page: so the jobs > do run OK on > the older HCAs, as long as they

Re: [OMPI users] Mpirun only works when n< 3

2011-07-11 Thread Jeff Squyres
Have you disabled firewalls between your compute nodes? On Jul 11, 2011, at 9:34 AM, Randolph Pullen wrote: > This appears to be similar to the problem described in: > > https://svn.open-mpi.org/trac/ompi/ticket/2043 > > However, those fixes do not work for me. > > I am running on an > > -

[OMPI users] Mpirun only works when n< 3

2011-07-11 Thread Randolph Pullen
This appears to be similar to the problem described in: https://svn.open-mpi.org/trac/ompi/ticket/2043 However, those fixes do not work for me. I am running on an - i5 sandy bridge under Ubuntu 10.10  8 G RAM - Kernel 2.6.32.14 with OpenVZ tweaks - OpenMPI V 1.4.1 I am tryin

Re: [OMPI users] OpenMPI with NAG compiler and gcc 4.6

2011-07-11 Thread Ning Li
Hi Jeff, I am willing to help test OpenMPI with the NAG compiler from time to time but not sure how. If you could give me specific instructions I am very happy to help. As for this technical issue, I did some research online. It appears that a later version of Libtool (probably 2.2.10+) added