Jeff, Yevgeny,

Thanks for your responses.

We found the problem. Issue was that the librdmacm-devel rpm was not installed 
on the build system.
Installed the rpm and re-built OpenMPI. Now RoCE works fine.

You might the requirement for the librdmacm-devel rpm to the install readme.

-Jeff



> -----Original Message-----
> From: Jeff Squyres [mailto:jsquy...@cisco.com]
> Sent: Wednesday, October 05, 2011 9:15 AM
> To: klit...@dev.mellanox.co.il; Open MPI Users
> Cc: Konz, Jeffrey (SSA Solution Centers)
> Subject: Re: [OMPI users] problem running with RoCE over 10GbE
> 
> On Oct 5, 2011, at 9:04 AM, Yevgeny Kliteynik wrote:
> 
> >> Built OpenMPI with this option "--enable-openib-rdmacm".
> >> Our system has OFED 1.5.2 with librdmacm-1.0.13-1
> >>
> >> I noticed this output from configure script:
> >> checking rdma/rdma_cma.h usability... no
> >> checking rdma/rdma_cma.h presence... no
> >> checking for rdma/rdma_cma.h... no
> >> checking whether IBV_LINK_LAYER_ETHERNET is declared... yes
> >> checking if RDMAoE support is enabled... yes
> >> checking for infiniband/driver.h... yes
> >> checking if ConnectX XRC support is enabled... yes
> >> checking if dynamic SL is enabled... no
> >> checking if OpenFabrics RDMACM support is enabled... no
> >>
> >> Are we missing a build option or a piece of software?
> >> Config.log and output from "ompi_info --all" attached.
> >
> > You shouldn't use the "--enable-openib-rdmacm" option - rdmacm
> > support is enabled by default, providing librdmacm is found on
> > the machine.
> 
> Actually, this might be a configure bug.  We have lots of other
> configure options that, even if "foo" support is optional, if you
> specify "--with-foo", then OMPI treats it as mandatory.  Specifically,
> if foo can't be found, it's an error and configure should abort (i.e.,
> let a human figure it out).
> 
> Yevgeny -- can you check that out?
> 
> > So the question is, why OMPI config script didn't find it?
> > OMPI looks for "rdma/rdma_cma.h" header. Do you have it on
> > you build machine?
> > The usual location of this file is /usr/include/rdma/rdma_cma.h
> 
> Here's the culprit in config.log:
> 
> configure:118771: checking rdma/rdma_cma.h presence
> configure:118771: gcc -E    conftest.c
> conftest.c:573:27: error: rdma/rdma_cma.h: No such file or directory
> configure:118771: $? = 1
> 
> I'd double check that that file is actually present on your system.  I
> don't think <> vs. "" will make a difference, though.
> 
> --
> Jeff Squyres
> jsquy...@cisco.com
> For corporate legal information go to:
> http://www.cisco.com/web/about/doing_business/legal/cri/


Reply via email to