Victor,

If the MX library supports a mapper, Open MPI takes provide selective hardware 
activation. Look at the MCA parameters supported by the MX devices to get more 
info (ompi_info —mca btl mx). The one that is of interest in this particular 
case is mx_if_include, allowing MX based jobs to only use the Myrinet card that 
has a mapper matching the provided key. You should set this MCA parameter to 
the last 6 digits of your mapper MAC (—mca btl mx_if_include abcdef).

  George.

On Jan 14, 2014, at 00:36 , Barrett, Brian W <bwba...@sandia.gov> wrote:

> Victor -
> 
> I don't think our multi-port support with MX is particularly well tested (I 
> know I don't test that path).
> 
> It looks like you might be able to work around the problem by setting -mca 
> mtl_mx_endpoint_num 1 on the mpirun command line, which will only use the 
> first port found.  But I could be wrong.
> 
> Brian
> 
> On 1/9/14 5:02 PM, "Victor Prosolin" <victor.proso...@rwdi.com> wrote:
> 
>> H,
>> Our cluster has servers with either a single port or a dual port Myrinet 
>> card. In case of a dual card, only one port is connected to the Myrinet 
>> switch. The OpenMPI library is configured with “--with-mx=…” option and it 
>> works fine when I submit jobs to single port servers only. However, when I 
>> try to include a server with a dual port card, I get a bunch of errors like 
>> the following:
>> [compute-08:17788] mx_connect fail for unknown 60dd464f9d nic_id with key 
>> aaaaffff (error Destination NIC not found in network table)
>>  
>> 60dd464f9d is the wrong MAC address corresponding to port 1 (not connected) 
>> when port 0 is connected to the switch and has MAC  60dd464f9c.
>>  
>> This is how (try to) I run the job:
>> 1.       mpiexec -np 32 -host compute-08,compute-17,compute-18,compute-16 
>> -mca mtl mx --mca pml cm ./wrf.exe
>> or
>> 2.       Using a similar command but via Sun Grid Engine.
>>  
>> The OS is Centos 6.4, 64bit. OpenMPI 1.6.5 compiled from the official src 
>> rpm with gcc 4.4.7, MX library 1.2.16 manually compiled. Again, this 
>> configuration works fine when only single port servers are used.
>>  
>> Is there a way to tell OpenMPI to stick to the one port that is connected? I 
>> haven’t found any options through ompi_info or via google… Any help will be 
>> greatly appreciated.
>>  
>> Sincerely,
>> Victor.
>>  
> 
> 
> 
> --
>   Brian W. Barrett
>   Scalable System Software Group
>   Sandia National Laboratories
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/users

Reply via email to