Re: [OMPI users] CentOS 6.3 & OpenMPI 1.6.3

2012-12-10 Thread Joseph Farran
Hi YK. An update that with your latest Mellanox drivers, I was able to compile OpenMPI 1.6.3 successfully. So yes the issue was with the mxm drivers. Thank you, Joseph On 12/06/2012 01:41 AM, Yevgeny Kliteynik wrote: Joseph, Indeed, there was a problem in the MXM rpm. The fixed MXM has bee

Re: [OMPI users] CentOS 6.3 & OpenMPI 1.6.3

2012-12-06 Thread Yevgeny Kliteynik
Joseph, Indeed, there was a problem in the MXM rpm. The fixed MXM has been published at the same location: http://mellanox.com/downloads/hpc/mxm/v1.1/mxm-latest.tar -- YK On 12/4/2012 9:20 AM, Joseph Farran wrote: > Hi Mike. > > Removed the old mxm, downloaded and installed: > > /tmp/mxm/v

Re: [OMPI users] CentOS 6.3 & OpenMPI 1.6.3

2012-12-04 Thread Joseph Farran
Hi Mike. Removed the old mxm, downloaded and installed: /tmp/mxm/v1.1/per-ofed/1.5.4.1/mxm-1.1.3a5e745-1.x86_64-rhel6u3.rpm I am suing OFED 1.5.4.1 and it still fails at the same spot: make[2]: Entering directory `/data/apps/sources/openmpi-1.6.3/ompi/mca/mtl/mxm' CC mtl_mxm.lo CC

Re: [OMPI users] CentOS 6.3 & OpenMPI 1.6.3

2012-12-03 Thread Mike Dubman
Please download http://mellanox.com/downloads/hpc/mxm/v1.1/mxm-latest.tar, it contains mxm.rpm for mofed 1.5.4.1 On Mon, Dec 3, 2012 at 8:18 AM, Mike Dubman wrote: > ohh.. you have MOFED 1.5.4.1, thought it was 1.5.3-3.1.0 > will provide you a link to mxm package compiled with this MOFED version

Re: [OMPI users] CentOS 6.3 & OpenMPI 1.6.3

2012-12-03 Thread Mike Dubman
ohh.. you have MOFED 1.5.4.1, thought it was 1.5.3-3.1.0 will provide you a link to mxm package compiled with this MOFED version (thanks to no ABI in OFED). On Sun, Dec 2, 2012 at 10:04 PM, Joseph Farran wrote: > 1.5.4.1

Re: [OMPI users] CentOS 6.3 & OpenMPI 1.6.3

2012-12-02 Thread Joseph Farran
No cigar with MLNX_OFED_LINUX-1.5.3-3.1.0-rhel6.3-x86_64 Here are my steps: - Removed OFED-1.5.4.1 & rebooted - Installed MLNX_OFED_LINUX-1.5.3-3.1.0-rhel6.3-x86_64 - rpm -e mxm - rpm -i mxm-1.1.3a5e745-1.x86_64-rhel6u3.rpm <--- The new mxm you built. - Rebooted Check: # rpm -qa | egrep "1.5.

Re: [OMPI users] CentOS 6.3 & OpenMPI 1.6.3

2012-12-02 Thread Joseph Farran
Next I will try MLNX_OFED_LINUX-1.5.3-3.1.0-rhel6.3-x86_64, with the mxm and try again. Joseph On 12/2/2012 12:04 PM, Joseph Farran wrote: Hi again. Had to get some sleep :-) Same thing. Let me outline the steps I took in case I missed something. I have a stock CentOS 6.3 with kernel 2.

Re: [OMPI users] CentOS 6.3 & OpenMPI 1.6.3

2012-12-02 Thread Joseph Farran
Hi again. Had to get some sleep :-) Same thing. Let me outline the steps I took in case I missed something. I have a stock CentOS 6.3 with kernel 2.6.32-279.14.1.el6.x86_64 Install OFED-1.5.4.1 as follows: cd OFED-1.5.4.1 ./install.pl --all --print-available grep -v debuginfo ofed-all.conf

Re: [OMPI users] CentOS 6.3 & OpenMPI 1.6.3

2012-12-02 Thread Mike Dubman
please redownload from http://mellanox.com/downloads/hpc/mxm/v1.1/mxm-latest.tar it contains binaries compiled with mofed 1.5.3-3.1.0 M On Sun, Dec 2, 2012 at 12:13 PM, Mike Dubman wrote: > > It seems that your active mofed is 1.5.3-3.1.0, while installed mxm was > compiled with 1.5.3-3.0.0 > MO

Re: [OMPI users] CentOS 6.3 & OpenMPI 1.6.3

2012-12-02 Thread Mike Dubman
It seems that your active mofed is 1.5.3-3.1.0, while installed mxm was compiled with 1.5.3-3.0.0 MOFED is not binary compatible, let me check and send you the link for mxm compiled with mofed that you have. Also, MOFED contains ompi 1.6.0 which is already compiled with mxm (/usr/mpi/...) On Sun,

Re: [OMPI users] CentOS 6.3 & OpenMPI 1.6.3

2012-12-02 Thread Joseph Farran
Same thing. My new config:     CFLAGS="" FCFLAGS="" ./configure    \     --with-sge  \     --with-openib=/usr  \     --enable-openib-connectx-xrc    \     --enable-mpi-thread-mul

Re: [OMPI users] CentOS 6.3 & OpenMPI 1.6.3

2012-12-02 Thread Mike Dubman
please change "--with-openib" to "--with-openib=/usr" and retry configure/make stage. 10x On Sun, Dec 2, 2012 at 10:36 AM, Joseph Farran wrote: > Hi Mike. > > Thanks for the help! > > I am installing OFED on an NFS share partition so that all compute nodes > will have access. > > For the "--wi

Re: [OMPI users] CentOS 6.3 & OpenMPI 1.6.3

2012-12-02 Thread Joseph Farran
Hi Mike. Thanks for the help! I am installing OFED on an NFS share partition so that all compute nodes will have access. For the "--with-openib" option, I don't specify one.   My config file looks like this:     CFLAGS="" FC

Re: [OMPI users] CentOS 6.3 & OpenMPI 1.6.3

2012-12-02 Thread Mike Dubman
Hi Joseph, I guess you install MOFED under /usr, is that right? Could you please specify "--with-openib=/usr" parameter during ompi "configure" stage? 10x M On Fri, Nov 30, 2012 at 1:11 AM, Joseph Farran wrote: > Hi YK: > > Yes, I have those installed but they are newer versions: > > # rpm -qa |

Re: [OMPI users] CentOS 6.3 & OpenMPI 1.6.3

2012-11-29 Thread Joseph Farran
Hi YK: Yes, I have those installed but they are newer versions: # rpm -qa | grep rdma librdmacm-1.0.15-1.x86_64 librdmacm-utils-1.0.15-1.x86_64 librdmacm-devel-1.0.15-1.x86_64 # locate librdmacm.la # Here are the RPMs the Mellanox build created for kernel: 2.6.32-279.14.1.el6.x86_64 # ls *rdm

Re: [OMPI users] CentOS 6.3 & OpenMPI 1.6.3

2012-11-29 Thread Yevgeny Kliteynik
Joseph, On 11/29/2012 11:50 PM, Joseph Farran wrote: > make[2]: Entering directory > `/data/apps/sources/openmpi-1.6.3/ompi/mca/mtl/mxm' > CC mtl_mxm.lo > CC mtl_mxm_cancel.lo > CC mtl_mxm_component.lo > CC mtl_mxm_endpoint.lo > CC mtl_mxm_probe.lo > CC mtl_mxm_recv.lo > CC mtl_mxm_send.lo > CCLD

Re: [OMPI users] CentOS 6.3 & OpenMPI 1.6.3

2012-11-29 Thread Joseph Farran
On 11/28/2012 10:53 AM, Mike Dubman wrote: You need mxm-1.1.3a5e745-1.x86_64-rhel6u3.rpm On Wed, Nov 28, 2012 at 7:44 PM, Joseph Farran mailto:jfar...@uci.edu>> wrote: mxm-1.1.3a5e745-1.x86_64-rhel6u3.rpm After installing MLNX_OFED_LINUX-1.5.3-3.1.0-rhel6.3-x86_64, removing the old mx

Re: [OMPI users] CentOS 6.3 & OpenMPI 1.6.3

2012-11-28 Thread Mike Dubman
You need mxm-1.1.3a5e745-1.x86_64-**rhel6u3.rpm On Wed, Nov 28, 2012 at 7:44 PM, Joseph Farran wrote: > mxm-1.1.3a5e745-1.x86_64-**rhel6u3.rpm >

Re: [OMPI users] CentOS 6.3 & OpenMPI 1.6.3

2012-11-28 Thread Joseph Farran
Question on the version to use. The tar files contains the following RPMS: # ls | grep -v debug mxm-1.1.3a5e745-1.x86_64-centos5u6.rpm mxm-1.1.3a5e745-1.x86_64-centos5u7.rpm mxm-1.1.3a5e745-1.x86_64-centos6u0.rpm mxm-1.1.3a5e745-1.x86_64-rhel5u5.rpm mxm-1.1.3a5e745-1.x86_64-rhel6u1.rpm mxm-1

Re: [OMPI users] CentOS 6.3 & OpenMPI 1.6.3

2012-11-28 Thread Joseph Farran
Perfect and Thanks! I had searched the Mellanox web site for the mxm package to no avail. I will try recompiling later today. Best, Joseph On 11/28/2012 06:29 AM, Yevgeny Kliteynik wrote: On 11/28/2012 10:52 AM, Pavel Mezentsev wrote: You can try downloading and installing a fresher version

Re: [OMPI users] CentOS 6.3 & OpenMPI 1.6.3

2012-11-28 Thread Yevgeny Kliteynik
On 11/28/2012 10:52 AM, Pavel Mezentsev wrote: > You can try downloading and installing a fresher version of MXM from mellanox > web site. There was a thread on the list with the same problem, you can > search for it. Indeed, that OFED version comes with older version of MXM. You can get the new

Re: [OMPI users] CentOS 6.3 & OpenMPI 1.6.3

2012-11-28 Thread Pavel Mezentsev
You can try downloading and installing a fresher version of MXM from mellanox web site. There was a thread on the list with the same problem, you can search for it. 2012/11/28 Joseph Farran > Howdy. > > I a have a stock CentOS 6.3 OS and a Mellanox MT26428 card. > > I installed the Mellanox OFE

[OMPI users] CentOS 6.3 & OpenMPI 1.6.3

2012-11-27 Thread Joseph Farran
Howdy. I a have a stock CentOS 6.3 OS and a Mellanox MT26428 card. I installed the Mellanox OFED MLNX_OFED_LINUX-1.5.3-3.1.0-rhel6.3-x86_64 which installed just fine. Rebooted the system and when I try building OpenMPI 1.6.3, it aborts with: mtl_mxm_send.c: In function 'ompi_mtl_mxm_isend':