Hey Ben, It looks like HCOLL v2.x is too old for OMPI v1.10. Could you download HPCx from http://mellanox.com/poducts/hpcx which contains latest hcoll and use it to compile OMPI?
(also HPCx contains OMPI sources and precompiled w/ HCOLL) On Thu, Dec 24, 2015 at 2:29 AM, Ben Menadue <ben.mena...@nci.org.au> wrote: > Hi Mike, > > > > 11:18 bjm900@raijin7 ~ > rpm -qi hcoll > > Name : hcoll Relocations: > /opt/mellanox/hcoll > > Version : 2.0.472 Vendor: mellanox > > Release : 1 Build Date: Sun Jan 5 > 23:40:18 2014 > > Install Date: Fri Jun 12 15:00:23 2015 Build Host: localhost > > Group : Applications Source RPM: > hcoll-2.0.472-1.src.rpm > > Size : 14074151 License: Proprietary > > Signature : DSA/SHA1, Sun Feb 23 23:20:58 2014, Key ID c5ed83e26224c050 > > Packager : mellanox > > URL : http://www.mellanox.com > > Summary : Hierarchical collectives (HCOLL) > > Description : > > HCOLL... > > > > I also just realised that it doesn’t work for any OpenMPI 1.8.x series or > 1.10.0 either – while it is able to build (unlike for 1.10.1) it fails at > runtime with a symbol lookup error: > > > > symbol lookup error: /apps/openmpi/1.8.8/lib/openmpi/mca_coll_hcoll.so: > undefined symbol: hcoll_check_mem_release_cb_needed > > > > That symbol doesn’t appear to be in any of the .so files in > /opt/mellanox/hcoll, so I’m guessing it’s from a newer version of hcoll. > This would also explain why it warned about an implicit declaration of this > symbol during the builds: > > > > ../../../../../../../../ompi/mca/coll/hcoll/coll_hcoll_module.c:263: > warning: implicit declaration of function > 'hcoll_check_mem_release_cb_needed' > > > > Cheers, > > Ben > > > > *From:* users [mailto:users-boun...@open-mpi.org] *On Behalf Of *Mike > Dubman > *Sent:* Thursday, 24 December 2015 7:14 AM > *To:* Open MPI Users <us...@open-mpi.org> > *Subject:* Re: [OMPI users] hcoll API in 1.10.1 > > > > Hi, > > hcoll is part of MOFED or comes from HPCx. > > what version of hcoll do you have on your system? > > > > Thx > > > > On Wed, Dec 23, 2015 at 4:58 AM, Ben Menadue <ben.mena...@nci.org.au> > wrote: > > Hi, > > It's probably in plain sight somewhere and I missed it, but is there a > minimum version of hcoll needed to build 1.10.1? > > We have 2.0.0, which allows us to build 1.10.0, but 1.10.1 fails with > missing entries in the hcoll_collectives_t structure: > > CC coll_hcoll_module.lo > ../../../../../../../../ompi/mca/coll/hcoll/coll_hcoll_module.c: In > function > 'mca_coll_hcoll_mem_release_cb': > ../../../../../../../../ompi/mca/coll/hcoll/coll_hcoll_module.c:60: > warning: > implicit declaration of function 'hcoll_mem_unmap' > ../../../../../../../../ompi/mca/coll/hcoll/coll_hcoll_module.c: In > function > 'hcoll_comm_attr_del_fn': > ../../../../../../../../ompi/mca/coll/hcoll/coll_hcoll_module.c:166: > warning: implicit declaration of function 'hcoll_group_destroy_notify' > ../../../../../../../../ompi/mca/coll/hcoll/coll_hcoll_module.c: In > function > 'mca_coll_hcoll_comm_query': > ../../../../../../../../ompi/mca/coll/hcoll/coll_hcoll_module.c:263: > warning: implicit declaration of function > 'hcoll_check_mem_release_cb_needed' > ../../../../../../../../ompi/mca/coll/hcoll/coll_hcoll_module.c:317: error: > 'hcoll_collectives_t' has no member named 'coll_alltoallv' > ../../../../../../../../ompi/mca/coll/hcoll/coll_hcoll_module.c:318: error: > 'hcoll_collectives_t' has no member named 'coll_gatherv' > ../../../../../../../../ompi/mca/coll/hcoll/coll_hcoll_module.c:324: error: > 'hcoll_collectives_t' has no member named 'coll_igatherv' > > It looks like PR567 in the 1.10 branch that added the new references. > > Cheers, > Ben > > > _______________________________________________ > users mailing list > us...@open-mpi.org > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users > Link to this post: > http://www.open-mpi.org/community/lists/users/2015/12/28200.php > > > > > > -- > > > > Kind Regards, > > > > M. > > _______________________________________________ > users mailing list > us...@open-mpi.org > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users > Link to this post: > http://www.open-mpi.org/community/lists/users/2015/12/28202.php > -- Kind Regards, M.