We're using TOSS which is based on Red Hat. The current version we're running is based on Red Hat 6.6. I'm actually not sure what mofed version we're using right now based on what I can find on the system and the admins over that are out. I'll get back to you on that as soon as I know.

Using LD_LIBRARY_PATH before configure got it to work, which I didn't expect. Thanks for the tip! I didn't realize that loading in a shared library of a library that is being linked in on the active compile line fell under the runtime portion of linking, and could be affected by using LD_LIBRARY_PATH.

Thanks!
David

On 10/21/2015 09:59 AM, Mike Dubman wrote:
Hi David,
what linux distro do you use? (and mofed version)?
Do you have /etc/ld.conf.d/mxm.conf file?
Can you please try add LD_LIBRARY_PATH=/opt/mellanox/mxm/lib ./configure ....?


Thanks

On Wed, Oct 21, 2015 at 6:40 PM, David Shrader <dshra...@lanl.gov <mailto:dshra...@lanl.gov>> wrote:

    I should probably point out that libhcoll.so does not know where
    libmxm.so is:

    [dshrader@zo-fe1 ~]$ ldd /opt/mellanox/hcoll/lib/libhcoll.so
            linux-vdso.so.1 =>  (0x00007fffb2f1f000)
            libibnetdisc.so.5 => /usr/lib64/libibnetdisc.so.5
    (0x00007fe31bd0b000)
            libmxm.so.2 => not found
            libz.so.1 => /lib64/libz.so.1 (0x00007fe31baf4000)
            libdl.so.2 => /lib64/libdl.so.2 (0x00007fe31b8f0000)
            libosmcomp.so.3 => /usr/lib64/libosmcomp.so.3
    (0x00007fe31b6e2000)
            libocoms.so.0 => /opt/mellanox/hcoll/lib/libocoms.so.0
    (0x00007fe31b499000)
            libm.so.6 => /lib64/libm.so.6 (0x00007fe31b215000)
            libnuma.so.1 => /usr/lib64/libnuma.so.1 (0x00007fe31b009000)
            libalog.so.0 => /opt/mellanox/hcoll/lib/libalog.so.0
    (0x00007fe31adfe000)
            librt.so.1 => /lib64/librt.so.1 (0x00007fe31abf6000)
            libibumad.so.3 => /usr/lib64/libibumad.so.3
    (0x00007fe31a9ee000)
            librdmacm.so.1 => /usr/lib64/librdmacm.so.1
    (0x00007fe31a7d9000)
            libibverbs.so.1 => /usr/lib64/libibverbs.so.1
    (0x00007fe31a5c7000)
            libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fe31a3a9000)
            libc.so.6 => /lib64/libc.so.6 (0x00007fe31a015000)
            libglib-2.0.so.0 => /lib64/libglib-2.0.so.0
    (0x00007fe319cfe000)
            libibmad.so.5 => /usr/lib64/libibmad.so.5 (0x00007fe319ae3000)
            /lib64/ld-linux-x86-64.so.2 (0x00007fe31c2d3000)
            libwrap.so.0 => /lib64/libwrap.so.0 (0x00007fe3198d8000)
            libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fe3196c2000)
            libnsl.so.1 => /lib64/libnsl.so.1 (0x00007fe3194a8000)
            libutil.so.1 => /lib64/libutil.so.1 (0x00007fe3192a5000)
            libnl.so.1 => /lib64/libnl.so.1 (0x00007fe319052000)

    Both hcoll and mxm where installed using the rpms provided by
    Mellanox.

    Thanks again,
    David


    On 10/21/2015 09:34 AM, David Shrader wrote:

        Hello All,

        I'm currently trying to install 1.10.0 with hcoll and mxm, and
        am getting an error during configure:

        --- MCA component coll:hcoll (m4 configuration macro)
        checking for MCA component coll:hcoll compile mode... static
        checking hcoll/api/hcoll_api.h usability... yes
        checking hcoll/api/hcoll_api.h presence... yes
        checking for hcoll/api/hcoll_api.h... yes
        looking for library in lib
        checking for library containing hcoll_get_version... no
        looking for library in lib64
        checking for library containing hcoll_get_version... no
        configure: error: HCOLL support requested but not found.  Aborting

        The configure line I used:

        ./configure --with-mxm=/opt/mellanox/mxm
        --with-hcoll=/opt/mellanox/hcoll
        --with-platform=contrib/platform/lanl/toss/optimized-panasas

        Here are the corresponding lines from config.log:

        configure:217014: gcc -std=gnu99 -o conftest -O3 -DNDEBUG
        -I/opt/panfs/include -finline-functions -fno-strict-aliasing
        -pthread
        
-I/usr/projects/hpctools/dshrader/hpcsoft/openmpi/1.10.0/openmpi-1.10.0/opal/mca/hwloc/hwloc191/hwloc/include
        
-I/usr/projects/hpctools/dshrader/hpcsoft/openmpi/1.10.0/openmpi-1.10.0/opal/mca/event/libevent2021/libevent
        
-I/usr/projects/hpctools/dshrader/hpcsoft/openmpi/1.10.0/openmpi-1.10.0/opal/mca/event/libevent2021/libevent/include
        -I/opt/mellanox/hcoll/include  -L/opt/mellanox/hcoll/lib
        conftest.c -lhcoll  -lrt -lm -lutil   >&5
        /usr/bin/ld: warning: libmxm.so.2, needed by
        /opt/mellanox/hcoll/lib/libhcoll.so, not found (try using
        -rpath or -rpath-link)
        /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
        `mxm_req_recv'
        /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
        `mxm_ep_create'
        /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
        `mxm_config_free_context_opts'
        /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
        `mxm_ep_destroy'
        /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
        `mxm_config_free_ep_opts'
        /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
        `mxm_progress'
        /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
        `mxm_config_read_opts'
        /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
        `mxm_ep_disconnect'
        /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
        `mxm_mq_destroy'
        /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
        `mxm_mq_create'
        /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
        `mxm_cleanup'
        /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
        `mxm_req_send'
        /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
        `mxm_ep_connect'
        /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
        `mxm_init'
        /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
        `mxm_ep_get_address'
        /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
        `mxm_error_string'
        /opt/mellanox/hcoll/lib/libhcoll.so: undefined reference to
        `mxm_mem_unmap'
        collect2: ld returned 1 exit status

        An ldd on /opt/mellanox/hcoll/lib/libhcoll.so shows a
        dependency on libmxm.so, so the above error makes sense. I am
        using hcoll version 3.3.768 and mxm version 3.4.3065 (reported
        by rpm).

        So, my question: is there a way to take care of this other
        than putting '-L/opt/mellanox/lib -lmxm' in to LDFLAGS/LIBS?
        Using LDFLAGS/LIBS will link mxm in to everything, which I
        would prefer not to do.

        Thanks in advance!
        David


-- David Shrader
    HPC-3 High Performance Computer Systems
    Los Alamos National Lab
    Email: dshrader <at> lanl.gov <http://lanl.gov>

    _______________________________________________
    users mailing list
    us...@open-mpi.org <mailto:us...@open-mpi.org>
    Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
    Link to this post:
    http://www.open-mpi.org/community/lists/users/2015/10/27907.php




--

Kind Regards,

M.


_______________________________________________
users mailing list
us...@open-mpi.org
Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
Link to this post: 
http://www.open-mpi.org/community/lists/users/2015/10/27908.php

--
David Shrader
HPC-3 High Performance Computer Systems
Los Alamos National Lab
Email: dshrader <at> lanl.gov

Reply via email to