Hi YK.
An update that with your latest Mellanox drivers, I was able to compile OpenMPI
1.6.3 successfully.
So yes the issue was with the mxm drivers.
Thank you,
Joseph
On 12/06/2012 01:41 AM, Yevgeny Kliteynik wrote:
Joseph,
Indeed, there was a problem in the MXM rpm.
The fixed MXM has bee
Joseph,
Indeed, there was a problem in the MXM rpm.
The fixed MXM has been published at the same location:
http://mellanox.com/downloads/hpc/mxm/v1.1/mxm-latest.tar
-- YK
On 12/4/2012 9:20 AM, Joseph Farran wrote:
> Hi Mike.
>
> Removed the old mxm, downloaded and installed:
>
> /tmp/mxm/v
Hi Mike.
Removed the old mxm, downloaded and installed:
/tmp/mxm/v1.1/per-ofed/1.5.4.1/mxm-1.1.3a5e745-1.x86_64-rhel6u3.rpm
I am suing OFED 1.5.4.1 and it still fails at the same spot:
make[2]: Entering directory `/data/apps/sources/openmpi-1.6.3/ompi/mca/mtl/mxm'
CC mtl_mxm.lo
CC
Please download http://mellanox.com/downloads/hpc/mxm/v1.1/mxm-latest.tar,
it contains mxm.rpm for mofed 1.5.4.1
On Mon, Dec 3, 2012 at 8:18 AM, Mike Dubman wrote:
> ohh.. you have MOFED 1.5.4.1, thought it was 1.5.3-3.1.0
> will provide you a link to mxm package compiled with this MOFED version
ohh.. you have MOFED 1.5.4.1, thought it was 1.5.3-3.1.0
will provide you a link to mxm package compiled with this MOFED version
(thanks to no ABI in OFED).
On Sun, Dec 2, 2012 at 10:04 PM, Joseph Farran wrote:
> 1.5.4.1
No cigar with MLNX_OFED_LINUX-1.5.3-3.1.0-rhel6.3-x86_64
Here are my steps:
- Removed OFED-1.5.4.1 & rebooted
- Installed MLNX_OFED_LINUX-1.5.3-3.1.0-rhel6.3-x86_64
- rpm -e mxm
- rpm -i mxm-1.1.3a5e745-1.x86_64-rhel6u3.rpm <--- The new mxm you built.
- Rebooted
Check:
# rpm -qa | egrep "1.5.
Next I will try MLNX_OFED_LINUX-1.5.3-3.1.0-rhel6.3-x86_64, with the mxm and
try again.
Joseph
On 12/2/2012 12:04 PM, Joseph Farran wrote:
Hi again.
Had to get some sleep :-)
Same thing. Let me outline the steps I took in case I missed something.
I have a stock CentOS 6.3 with kernel 2.
Hi again.
Had to get some sleep :-)
Same thing. Let me outline the steps I took in case I missed something.
I have a stock CentOS 6.3 with kernel 2.6.32-279.14.1.el6.x86_64
Install OFED-1.5.4.1 as follows:
cd OFED-1.5.4.1
./install.pl --all --print-available
grep -v debuginfo ofed-all.conf
please redownload from
http://mellanox.com/downloads/hpc/mxm/v1.1/mxm-latest.tar
it contains binaries compiled with mofed 1.5.3-3.1.0
M
On Sun, Dec 2, 2012 at 12:13 PM, Mike Dubman wrote:
>
> It seems that your active mofed is 1.5.3-3.1.0, while installed mxm was
> compiled with 1.5.3-3.0.0
> MO
It seems that your active mofed is 1.5.3-3.1.0, while installed mxm was
compiled with 1.5.3-3.0.0
MOFED is not binary compatible, let me check and send you the link for mxm
compiled with mofed that you have.
Also, MOFED contains ompi 1.6.0 which is already compiled with mxm
(/usr/mpi/...)
On Sun,
Same thing.
My new config:
CFLAGS="" FCFLAGS="" ./configure \
--with-sge \
--with-openib=/usr \
--enable-openib-connectx-xrc \
--enable-mpi-thread-mul
please change "--with-openib" to "--with-openib=/usr" and retry
configure/make stage.
10x
On Sun, Dec 2, 2012 at 10:36 AM, Joseph Farran wrote:
> Hi Mike.
>
> Thanks for the help!
>
> I am installing OFED on an NFS share partition so that all compute nodes
> will have access.
>
> For the "--wi
Hi Mike.
Thanks for the help!
I am installing OFED on an NFS share partition so that all compute
nodes will have access.
For the "--with-openib" option, I don't specify one. My config
file looks like this:
CFLAGS="" FC
Hi Joseph,
I guess you install MOFED under /usr, is that right?
Could you please specify "--with-openib=/usr" parameter during ompi
"configure" stage?
10x
M
On Fri, Nov 30, 2012 at 1:11 AM, Joseph Farran wrote:
> Hi YK:
>
> Yes, I have those installed but they are newer versions:
>
> # rpm -qa |
Hi YK:
Yes, I have those installed but they are newer versions:
# rpm -qa | grep rdma
librdmacm-1.0.15-1.x86_64
librdmacm-utils-1.0.15-1.x86_64
librdmacm-devel-1.0.15-1.x86_64
# locate librdmacm.la
#
Here are the RPMs the Mellanox build created for kernel:
2.6.32-279.14.1.el6.x86_64
# ls *rdm
Joseph,
On 11/29/2012 11:50 PM, Joseph Farran wrote:
> make[2]: Entering directory
> `/data/apps/sources/openmpi-1.6.3/ompi/mca/mtl/mxm'
> CC mtl_mxm.lo
> CC mtl_mxm_cancel.lo
> CC mtl_mxm_component.lo
> CC mtl_mxm_endpoint.lo
> CC mtl_mxm_probe.lo
> CC mtl_mxm_recv.lo
> CC mtl_mxm_send.lo
> CCLD
On 11/28/2012 10:53 AM, Mike Dubman wrote:
You need mxm-1.1.3a5e745-1.x86_64-rhel6u3.rpm
On Wed, Nov 28, 2012 at 7:44 PM, Joseph Farran mailto:jfar...@uci.edu>> wrote:
mxm-1.1.3a5e745-1.x86_64-rhel6u3.rpm
After installing MLNX_OFED_LINUX-1.5.3-3.1.0-rhel6.3-x86_64, removing the old
mx
You need mxm-1.1.3a5e745-1.x86_64-**rhel6u3.rpm
On Wed, Nov 28, 2012 at 7:44 PM, Joseph Farran wrote:
> mxm-1.1.3a5e745-1.x86_64-**rhel6u3.rpm
>
Question on the version to use. The tar files contains the following RPMS:
# ls | grep -v debug
mxm-1.1.3a5e745-1.x86_64-centos5u6.rpm
mxm-1.1.3a5e745-1.x86_64-centos5u7.rpm
mxm-1.1.3a5e745-1.x86_64-centos6u0.rpm
mxm-1.1.3a5e745-1.x86_64-rhel5u5.rpm
mxm-1.1.3a5e745-1.x86_64-rhel6u1.rpm
mxm-1
Perfect and Thanks!
I had searched the Mellanox web site for the mxm package to no avail.
I will try recompiling later today.
Best,
Joseph
On 11/28/2012 06:29 AM, Yevgeny Kliteynik wrote:
On 11/28/2012 10:52 AM, Pavel Mezentsev wrote:
You can try downloading and installing a fresher version
On 11/28/2012 10:52 AM, Pavel Mezentsev wrote:
> You can try downloading and installing a fresher version of MXM from mellanox
> web site. There was a thread on the list with the same problem, you can
> search for it.
Indeed, that OFED version comes with older version of MXM.
You can get the new
You can try downloading and installing a fresher version of MXM from
mellanox web site. There was a thread on the list with the same problem,
you can search for it.
2012/11/28 Joseph Farran
> Howdy.
>
> I a have a stock CentOS 6.3 OS and a Mellanox MT26428 card.
>
> I installed the Mellanox OFE
Howdy.
I a have a stock CentOS 6.3 OS and a Mellanox MT26428 card.
I installed the Mellanox OFED MLNX_OFED_LINUX-1.5.3-3.1.0-rhel6.3-x86_64 which
installed just fine. Rebooted the system and when I try building OpenMPI
1.6.3, it aborts with:
mtl_mxm_send.c: In function 'ompi_mtl_mxm_isend':
23 matches
Mail list logo