I should probably point out that libhcoll.so does not know where libmxm.so is:

[dshrader@zo-fe1 ~]$ ldd /opt/mellanox/hcoll/lib/libhcoll.so
        linux-vdso.so.1 =>  (0x00007fffb2f1f000)
libibnetdisc.so.5 => /usr/lib64/libibnetdisc.so.5 (0x00007fe31bd0b000)
        libmxm.so.2 => not found
        libz.so.1 => /lib64/libz.so.1 (0x00007fe31baf4000)
        libdl.so.2 => /lib64/libdl.so.2 (0x00007fe31b8f0000)
        libosmcomp.so.3 => /usr/lib64/libosmcomp.so.3 (0x00007fe31b6e2000)
libocoms.so.0 => /opt/mellanox/hcoll/lib/libocoms.so.0 (0x00007fe31b499000)
        libm.so.6 => /lib64/libm.so.6 (0x00007fe31b215000)
        libnuma.so.1 => /usr/lib64/libnuma.so.1 (0x00007fe31b009000)
libalog.so.0 => /opt/mellanox/hcoll/lib/libalog.so.0 (0x00007fe31adfe000)
        librt.so.1 => /lib64/librt.so.1 (0x00007fe31abf6000)
        libibumad.so.3 => /usr/lib64/libibumad.so.3 (0x00007fe31a9ee000)
        librdmacm.so.1 => /usr/lib64/librdmacm.so.1 (0x00007fe31a7d9000)
        libibverbs.so.1 => /usr/lib64/libibverbs.so.1 (0x00007fe31a5c7000)
        libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fe31a3a9000)
        libc.so.6 => /lib64/libc.so.6 (0x00007fe31a015000)
        libglib-2.0.so.0 => /lib64/libglib-2.0.so.0 (0x00007fe319cfe000)
        libibmad.so.5 => /usr/lib64/libibmad.so.5 (0x00007fe319ae3000)
        /lib64/ld-linux-x86-64.so.2 (0x00007fe31c2d3000)
        libwrap.so.0 => /lib64/libwrap.so.0 (0x00007fe3198d8000)
        libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fe3196c2000)
        libnsl.so.1 => /lib64/libnsl.so.1 (0x00007fe3194a8000)
        libutil.so.1 => /lib64/libutil.so.1 (0x00007fe3192a5000)
        libnl.so.1 => /lib64/libnl.so.1 (0x00007fe319052000)

Both hcoll and mxm where installed using the rpms provided by Mellanox.

Thanks again,
David

On 10/21/2015 09:34 AM, David Shrader wrote:
Hello All,

I'm currently trying to install 1.10.0 with hcoll and mxm, and am getting an error during configure:

--- MCA component coll:hcoll (m4 configuration macro)
checking for MCA component coll:hcoll compile mode... static
checking hcoll/api/hcoll_api.h usability... yes
checking hcoll/api/hcoll_api.h presence... yes
checking for hcoll/api/hcoll_api.h... yes
looking for library in lib
checking for library containing hcoll_get_version... no
looking for library in lib64
checking for library containing hcoll_get_version... no
configure: error: HCOLL support requested but not found.  Aborting

The configure line I used:

./configure --with-mxm=/opt/mellanox/mxm --with-hcoll=/opt/mellanox/hcoll --with-platform=contrib/platform/lanl/toss/optimized-panasas

Here are the corresponding lines from config.log:

configure:217014: gcc -std=gnu99 -o conftest -O3 -DNDEBUG -I/opt/panfs/include -finline-functions -fno-strict-aliasing -pthread -I/usr/projects/hpctools/dshrader/hpcsoft/openmpi/1.10.0/openmpi-1.10.0/opal/mca/hwloc/hwloc191/hwloc/include -I/usr/projects/hpctools/dshrader/hpcsoft/openmpi/1.10.0/openmpi-1.10.0/opal/mca/event/libevent2021/libevent -I/usr/projects/hpctools/dshrader/hpcsoft/openmpi/1.10.0/openmpi-1.10.0/opal/mca/event/libevent2021/libevent/include -I/opt/mellanox/hcoll/include -L/opt/mellanox/hcoll/lib conftest.c -lhcoll -lrt -lm -lutil >&5 /usr/bin/ld: warning: libmxm.so.2, needed by /opt/mellanox/hcoll/lib/libhcoll.so, not found (try using -rpath or -rpath-link) /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to `mxm_req_recv' /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to `mxm_ep_create' /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to `mxm_config_free_context_opts' /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to `mxm_ep_destroy' /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to `mxm_config_free_ep_opts' /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to `mxm_progress' /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to `mxm_config_read_opts' /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to `mxm_ep_disconnect' /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to `mxm_mq_destroy' /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to `mxm_mq_create'
/opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to `mxm_cleanup'
/opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to `mxm_req_send' /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to `mxm_ep_connect'
/opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to `mxm_init'
/opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to `mxm_ep_get_address' /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to `mxm_error_string' /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to `mxm_mem_unmap'
collect2: ld returned 1 exit status

An ldd on /opt/mellanox/hcoll/lib/libhcoll.so shows a dependency on libmxm.so, so the above error makes sense. I am using hcoll version 3.3.768 and mxm version 3.4.3065 (reported by rpm).

So, my question: is there a way to take care of this other than putting '-L/opt/mellanox/lib -lmxm' in to LDFLAGS/LIBS? Using LDFLAGS/LIBS will link mxm in to everything, which I would prefer not to do.

Thanks in advance!
David


--
David Shrader
HPC-3 High Performance Computer Systems
Los Alamos National Lab
Email: dshrader <at> lanl.gov

Reply via email to