I should probably point out that libhcoll.so does not know where
libmxm.so is:
[dshrader@zo-fe1 ~]$ ldd /opt/mellanox/hcoll/lib/libhcoll.so
linux-vdso.so.1 => (0x00007fffb2f1f000)
libibnetdisc.so.5 => /usr/lib64/libibnetdisc.so.5
(0x00007fe31bd0b000)
libmxm.so.2 => not found
libz.so.1 => /lib64/libz.so.1 (0x00007fe31baf4000)
libdl.so.2 => /lib64/libdl.so.2 (0x00007fe31b8f0000)
libosmcomp.so.3 => /usr/lib64/libosmcomp.so.3 (0x00007fe31b6e2000)
libocoms.so.0 => /opt/mellanox/hcoll/lib/libocoms.so.0
(0x00007fe31b499000)
libm.so.6 => /lib64/libm.so.6 (0x00007fe31b215000)
libnuma.so.1 => /usr/lib64/libnuma.so.1 (0x00007fe31b009000)
libalog.so.0 => /opt/mellanox/hcoll/lib/libalog.so.0
(0x00007fe31adfe000)
librt.so.1 => /lib64/librt.so.1 (0x00007fe31abf6000)
libibumad.so.3 => /usr/lib64/libibumad.so.3 (0x00007fe31a9ee000)
librdmacm.so.1 => /usr/lib64/librdmacm.so.1 (0x00007fe31a7d9000)
libibverbs.so.1 => /usr/lib64/libibverbs.so.1 (0x00007fe31a5c7000)
libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fe31a3a9000)
libc.so.6 => /lib64/libc.so.6 (0x00007fe31a015000)
libglib-2.0.so.0 => /lib64/libglib-2.0.so.0 (0x00007fe319cfe000)
libibmad.so.5 => /usr/lib64/libibmad.so.5 (0x00007fe319ae3000)
/lib64/ld-linux-x86-64.so.2 (0x00007fe31c2d3000)
libwrap.so.0 => /lib64/libwrap.so.0 (0x00007fe3198d8000)
libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fe3196c2000)
libnsl.so.1 => /lib64/libnsl.so.1 (0x00007fe3194a8000)
libutil.so.1 => /lib64/libutil.so.1 (0x00007fe3192a5000)
libnl.so.1 => /lib64/libnl.so.1 (0x00007fe319052000)
Both hcoll and mxm where installed using the rpms provided by Mellanox.
Thanks again,
David
On 10/21/2015 09:34 AM, David Shrader wrote:
Hello All,
I'm currently trying to install 1.10.0 with hcoll and mxm, and am
getting an error during configure:
--- MCA component coll:hcoll (m4 configuration macro)
checking for MCA component coll:hcoll compile mode... static
checking hcoll/api/hcoll_api.h usability... yes
checking hcoll/api/hcoll_api.h presence... yes
checking for hcoll/api/hcoll_api.h... yes
looking for library in lib
checking for library containing hcoll_get_version... no
looking for library in lib64
checking for library containing hcoll_get_version... no
configure: error: HCOLL support requested but not found. Aborting
The configure line I used:
./configure --with-mxm=/opt/mellanox/mxm
--with-hcoll=/opt/mellanox/hcoll
--with-platform=contrib/platform/lanl/toss/optimized-panasas
Here are the corresponding lines from config.log:
configure:217014: gcc -std=gnu99 -o conftest -O3 -DNDEBUG
-I/opt/panfs/include -finline-functions -fno-strict-aliasing -pthread
-I/usr/projects/hpctools/dshrader/hpcsoft/openmpi/1.10.0/openmpi-1.10.0/opal/mca/hwloc/hwloc191/hwloc/include
-I/usr/projects/hpctools/dshrader/hpcsoft/openmpi/1.10.0/openmpi-1.10.0/opal/mca/event/libevent2021/libevent
-I/usr/projects/hpctools/dshrader/hpcsoft/openmpi/1.10.0/openmpi-1.10.0/opal/mca/event/libevent2021/libevent/include
-I/opt/mellanox/hcoll/include -L/opt/mellanox/hcoll/lib conftest.c
-lhcoll -lrt -lm -lutil >&5
/usr/bin/ld: warning: libmxm.so.2, needed by
/opt/mellanox/hcoll/lib/libhcoll.so, not found (try using -rpath or
-rpath-link)
/opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
`mxm_req_recv'
/opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
`mxm_ep_create'
/opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
`mxm_config_free_context_opts'
/opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
`mxm_ep_destroy'
/opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
`mxm_config_free_ep_opts'
/opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
`mxm_progress'
/opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
`mxm_config_read_opts'
/opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
`mxm_ep_disconnect'
/opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
`mxm_mq_destroy'
/opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
`mxm_mq_create'
/opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to `mxm_cleanup'
/opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
`mxm_req_send'
/opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
`mxm_ep_connect'
/opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to `mxm_init'
/opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
`mxm_ep_get_address'
/opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
`mxm_error_string'
/opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
`mxm_mem_unmap'
collect2: ld returned 1 exit status
An ldd on /opt/mellanox/hcoll/lib/libhcoll.so shows a dependency on
libmxm.so, so the above error makes sense. I am using hcoll version
3.3.768 and mxm version 3.4.3065 (reported by rpm).
So, my question: is there a way to take care of this other than
putting '-L/opt/mellanox/lib -lmxm' in to LDFLAGS/LIBS? Using
LDFLAGS/LIBS will link mxm in to everything, which I would prefer not
to do.
Thanks in advance!
David
--
David Shrader
HPC-3 High Performance Computer Systems
Los Alamos National Lab
Email: dshrader <at> lanl.gov