Re: [OMPI users] MX performance problem on two processor nodes

Brock Palen Wed, 22 Nov 2006 00:10:25 -0500

Feel free to correct me if im wrong.

OMPI assumes you have a fast network and checks for them. If theyare not found it falls back to tcp.

So if you leave out the --mca etc etc.... It should use the mx ifits available. Im not sure how MX responds if one of the hosts doesnot have a working card (not activated) because the mpi job willstill run. Just not using MX to that host. All other hosts will us MX.

If openmpi sees that a node has more than one cpu (SMP) It will usethe sm (shared mem) method to communicate over the mx. and if aproc sends to its self, the self method is used. So its like apriority order.

I know there is a way (its in the archives) to see the priority onhow OMPI choses what method to use. and uses the highest prioritythat will allow the communication to complete.

I know there is also some magic being working on/implemented. Thatwill stripe over multiple networks for large messages when morebandwidth is needed. I dont know if OMPI will have this ability ornot. Someone else can chime in on that.


Brock Palen
Center for Advanced Computing
bro...@umich.edu
(734)936-1985


On Nov 21, 2006, at 11:28 PM, Iannetti, Anthony C. ((GRC-RTB0)) wrote:

Dear OpenMPI List:
From looking at a recent thread, I see an mpirun command withshared memory and mx:
mpirun –mca btl mx,sm,self –np 2 pi3f90.x
This works. I may have forgot to mention it, but I am using1.1.2. I see there is an –mca mtl in version 1.2b1 . I do notthink this exists in 1.1.2.
Still, I would like to know what –mca is given automatically.



Thanks,

Tony







Anthony C. Iannetti, P.E.

NASA Glenn Research Center

Propulsion Systems Division, Combustion Branch

21000 Brookpark Road, MS 5-10

Cleveland, OH 44135

phone: (216)433-5586

email: anthony.c.ianne...@nasa.gov
Please note: All opinions expressed in this message are my own andNOT of NASA. Only the NASA Administrator can speak on behalf of NASA.
From: Iannetti, Anthony C. (GRC-RTB0)
Sent: Tuesday, November 21, 2006 8:39 PM
To: 'us...@open-mpi.org'
Subject: MX performance problem on two processor nodes



Dear OpenMPI List:
I am running the Myrinet MX btl with OpenMPI on MacOSX10.4. I am running into a problem. When I run on one processorper node, OpenMPI runs just fine. When I run on two processorsper node (slots=2), it seems to take forever (something is hanging).
Here is the command:

mpirun –mca btl mx,self –np 2 pi3f90.x



However, if I give the command:

mpirun –np 2 pi3f90.x
The process runs normally. But, I do not know if it is using theMyrinet network. Is there a way to diagnose this problem. mpirun –v and –d do not seem to indicate which mca is actually being used.
Thanks,

Tony



Anthony C. Iannetti, P.E.

NASA Glenn Research Center

Propulsion Systems Division, Combustion Branch

21000 Brookpark Road, MS 5-10

Cleveland, OH 44135

phone: (216)433-5586

email: anthony.c.ianne...@nasa.gov
Please note: All opinions expressed in this message are my own andNOT of NASA. Only the NASA Administrator can speak on behalf of NASA.
_______________________________________________
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users

Re: [OMPI users] MX performance problem on two processor nodes

Reply via email to