Hi Jeff and Ralph,

Thanks.  I'm really a novice user - and in cases like this one I don't really 
know what I'm doing.  In this case, I just wanted to get my application to run 
without throwing strange error messages and quitting. :)  That said, I would 
much rather learn about the components of MPI rather than taking shots in the 
dark.

On different clusters where I was getting error messages related to a 
component, the advice from this mailing list was to disable that component.  
Currently I'm building OpenMPI with all components, and my environment 
variables disable the components at runtime: I have OMPI_MCA_ras=^tm and 
OMPI_MCA_btl=self,sm,tcp.  

The latter seems to disable the advanced networking-related components that 
were throwing the errors.  I am not sure how a BTL works for MPI ranks that are 
running on the same node.  Do the different MPI ranks (processes) on a node 
still use a BTL to communicate with each other?  And which one does it prefer 
to use (sm, tcp or something else?)

Thanks,

- Lee-Ping

On Sep 30, 2014, at 7:29 AM, Jeff Squyres (jsquyres) <jsquy...@cisco.com> wrote:

> How can you run MPI jobs at all without any BTLs?  That sounds weird -- this 
> is not a case for which we designed the code base.
> 
> All that being said, you're getting compile errors in the OMPI build because 
> of two things:
> 
> - you selected to build static
> - you didn't disable enough stuff
> 
> Specifically, statically building verbs-based code is not for the meek (see 
> the FAQ).  We have verbs-based code in a few places: the BTLs, and also in 
> the "common" framework.  Hence, the linker errors you are getting are because 
> the "common" verbs component was still built (because it wasn't disabled), 
> and because building statically with verbs is... tricky (see the FAQ).
> 
> You might have better luck with:
> 
> ./configure --enable-mca-no-build=btl,common-verbs ...
> 
> Or, better yet:
> 
> ./configure --enable-mca-no-build=btl --without-verbs ...
> 
> But again, I'm not sure how well OMPI will function without any BTLs.
> 
> 
> 
> On Sep 29, 2014, at 11:47 PM, Ralph Castain <r...@open-mpi.org> wrote:
> 
>> ompi_info is just the first time when an executable is built, and so it 
>> always is the place where we find missing library issues. It looks like 
>> someone has left incorrect configure logic in the system such that we always 
>> attempt to build Infiniband-related code, but without linking against the 
>> library.
>> 
>> We'll have to try and track it down.
>> 
>> On Sep 29, 2014, at 5:08 PM, Lee-Ping Wang <leep...@stanford.edu> wrote:
>> 
>>> Hi Gus,
>>> 
>>> Thank you.  I did start from a completely clean directory tree every time 
>>> (I deleted the whole folder and re-extracted the tarball).
>>> 
>>> I noticed that disabling any of the BTL components resulted in the same 
>>> error, so my solution was to build everything and disable certain 
>>> components at runtime.
>>> 
>>> - Lee-Ping
>>> 
>>> On Sep 29, 2014, at 6:03 AM, Gustavo Correa <g...@ldeo.columbia.edu> wrote:
>>> 
>>>> Hi Lee-Ping 
>>>> 
>>>> Did you cleanup the old build, to start fresh?
>>>> 
>>>> make distclean 
>>>> configure --disable-vt ...
>>>> ...
>>>> 
>>>> I hope this helps,
>>>> Gus Correa
>>>> 
>>>> On Sep 29, 2014, at 8:47 AM, Lee-Ping Wang wrote:
>>>> 
>>>>> Hmm, the build doesn't finish - it breaks when trying to create the man 
>>>>> page.  I guess I'll disable only a few specific BTL components that have 
>>>>> given me issues in the past. 
>>>>> 
>>>>> Creating ompi_info.1 man page...
>>>>> CCLD     ompi_info
>>>>> ../../../ompi/.libs/libmpi.so: undefined reference to 
>>>>> `ibv_free_device_list'
>>>>> ../../../ompi/.libs/libmpi.so: undefined reference to `ibv_alloc_pd'
>>>>> ../../../ompi/.libs/libmpi.so: undefined reference to `ibv_close_device'
>>>>> ../../../ompi/.libs/libmpi.so: undefined reference to `ibv_dealloc_pd'
>>>>> ../../../ompi/.libs/libmpi.so: undefined reference to `ibv_destroy_qp'
>>>>> ../../../ompi/.libs/libmpi.so: undefined reference to `ibv_create_cq'
>>>>> ../../../ompi/.libs/libmpi.so: undefined reference to `ibv_get_sysfs_path'
>>>>> ../../../ompi/.libs/libmpi.so: undefined reference to `ibv_open_device'
>>>>> ../../../ompi/.libs/libmpi.so: undefined reference to `ibv_create_qp'
>>>>> ../../../ompi/.libs/libmpi.so: undefined reference to `ibv_query_device'
>>>>> ../../../ompi/.libs/libmpi.so: undefined reference to 
>>>>> `ibv_get_device_list'
>>>>> ../../../ompi/.libs/libmpi.so: undefined reference to 
>>>>> `ibv_get_device_name'
>>>>> ../../../ompi/.libs/libmpi.so: undefined reference to `ibv_destroy_cq'
>>>>> collect2: error: ld returned 1 exit status
>>>>> 
>>>>> Thanks,
>>>>> 
>>>>> - Lee-Ping
>>>>> 
>>>>> On Sep 29, 2014, at 5:27 AM, Lee-Ping Wang <leep...@stanford.edu> wrote:
>>>>> 
>>>>>> Hi there,
>>>>>> 
>>>>>> I'm building OpenMPI 1.8.3 on a system where I explicitly don't want any 
>>>>>> of the BTL components (they tend to break my single node jobs).  
>>>>>> 
>>>>>> ./configure CC=gcc CXX=g++ F77=gfortran FC=gfortran 
>>>>>> --prefix=$QC_EXT_LIBS/openmpi --enable-static --enable-mca-no-build=btl
>>>>>> 
>>>>>> Building gives me this error in the vt component - it appears to be 
>>>>>> expecting some Infiniband stuff:
>>>>>> 
>>>>>> CCLD     otfmerge-mpi
>>>>>> /u/sciteam/leeping/opt/qchem-4.2/ext-libs/openmpi-1.8.3/ompi/contrib/vt/vt/../../../.libs/libmpi.so:
>>>>>>  undefined reference to `ibv_free_device_list'
>>>>>> /u/sciteam/leeping/opt/qchem-4.2/ext-libs/openmpi-1.8.3/ompi/contrib/vt/vt/../../../.libs/libmpi.so:
>>>>>>  undefined reference to `ibv_alloc_pd'
>>>>>> /u/sciteam/leeping/opt/qchem-4.2/ext-libs/openmpi-1.8.3/ompi/contrib/vt/vt/../../../.libs/libmpi.so:
>>>>>>  undefined reference to `ibv_close_device'
>>>>>> /u/sciteam/leeping/opt/qchem-4.2/ext-libs/openmpi-1.8.3/ompi/contrib/vt/vt/../../../.libs/libmpi.so:
>>>>>>  undefined reference to `ibv_dealloc_pd'
>>>>>> /u/sciteam/leeping/opt/qchem-4.2/ext-libs/openmpi-1.8.3/ompi/contrib/vt/vt/../../../.libs/libmpi.so:
>>>>>>  undefined reference to `ibv_destroy_qp'
>>>>>> /u/sciteam/leeping/opt/qchem-4.2/ext-libs/openmpi-1.8.3/ompi/contrib/vt/vt/../../../.libs/libmpi.so:
>>>>>>  undefined reference to `ibv_create_cq'
>>>>>> /u/sciteam/leeping/opt/qchem-4.2/ext-libs/openmpi-1.8.3/ompi/contrib/vt/vt/../../../.libs/libmpi.so:
>>>>>>  undefined reference to `ibv_get_sysfs_path'
>>>>>> /u/sciteam/leeping/opt/qchem-4.2/ext-libs/openmpi-1.8.3/ompi/contrib/vt/vt/../../../.libs/libmpi.so:
>>>>>>  undefined reference to `ibv_open_device'
>>>>>> /u/sciteam/leeping/opt/qchem-4.2/ext-libs/openmpi-1.8.3/ompi/contrib/vt/vt/../../../.libs/libmpi.so:
>>>>>>  undefined reference to `ibv_create_qp'
>>>>>> /u/sciteam/leeping/opt/qchem-4.2/ext-libs/openmpi-1.8.3/ompi/contrib/vt/vt/../../../.libs/libmpi.so:
>>>>>>  undefined reference to `ibv_query_device'
>>>>>> /u/sciteam/leeping/opt/qchem-4.2/ext-libs/openmpi-1.8.3/ompi/contrib/vt/vt/../../../.libs/libmpi.so:
>>>>>>  undefined reference to `ibv_get_device_list'
>>>>>> /u/sciteam/leeping/opt/qchem-4.2/ext-libs/openmpi-1.8.3/ompi/contrib/vt/vt/../../../.libs/libmpi.so:
>>>>>>  undefined reference to `ibv_get_device_name'
>>>>>> /u/sciteam/leeping/opt/qchem-4.2/ext-libs/openmpi-1.8.3/ompi/contrib/vt/vt/../../../.libs/libmpi.so:
>>>>>>  undefined reference to `ibv_destroy_cq'
>>>>>> collect2: error: ld returned 1 exit status
>>>>>> make[10]: *** [otfmerge-mpi] Error 1
>>>>>> 
>>>>>> I've decided to disable the vt component since I doubt I'm using it, but 
>>>>>> this could be good to know.
>>>>>> 
>>>>>> Thanks,
>>>>>> 
>>>>>> - Lee-Ping
>>>>> 
>>>>> _______________________________________________
>>>>> users mailing list
>>>>> us...@open-mpi.org
>>>>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
>>>>> Link to this post: 
>>>>> http://www.open-mpi.org/community/lists/users/2014/09/25399.php
>>>> 
>>>> _______________________________________________
>>>> users mailing list
>>>> us...@open-mpi.org
>>>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
>>>> Link to this post: 
>>>> http://www.open-mpi.org/community/lists/users/2014/09/25400.php
>>> 
>>> 
>>> _______________________________________________
>>> users mailing list
>>> us...@open-mpi.org
>>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
>>> Link to this post: 
>>> http://www.open-mpi.org/community/lists/users/2014/09/25409.php
>> 
>> _______________________________________________
>> users mailing list
>> us...@open-mpi.org
>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
>> Link to this post: 
>> http://www.open-mpi.org/community/lists/users/2014/09/25414.php
> 
> 
> -- 
> Jeff Squyres
> jsquy...@cisco.com
> For corporate legal information go to: 
> http://www.cisco.com/web/about/doing_business/legal/cri/
> 
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
> Link to this post: 
> http://www.open-mpi.org/community/lists/users/2014/09/25416.php


Reply via email to