Re: [OMPI users] Infiniband performance Problem and stalling

2012-09-10 Thread Randolph Pullen
Cc: OpenMPI Users Sent: Monday, 10 September 2012 9:11 PM Subject: Re: [OMPI users] Infiniband performance Problem and stalling Randolph, So what you saying in short, leaving all the numbers aside, is the following: In your particular application on your particular setup with this particular

Re: [OMPI users] Infiniband performance Problem and stalling

2012-09-10 Thread Yevgeny Kliteynik
Yevgeny Kliteynik > *To:* Randolph Pullen > *Cc:* OpenMPI Users > *Sent:* Sunday, 9 September 2012 6:18 PM > *Subject:* Re: [OMPI users] Infiniband performance Problem and stalling > > Randolph, > > O

Re: [OMPI users] Infiniband performance Problem and stalling

2012-09-09 Thread Randolph Pullen
See my comments in line... From: Yevgeny Kliteynik To: Randolph Pullen Cc: OpenMPI Users Sent: Sunday, 9 September 2012 6:18 PM Subject: Re: [OMPI users] Infiniband performance Problem and stalling Randolph, On 9/7/2012 7:43 AM, Randolph Pullen wrote

Re: [OMPI users] Infiniband performance Problem and stalling

2012-09-09 Thread Yevgeny Kliteynik
Randolph, On 9/7/2012 7:43 AM, Randolph Pullen wrote: > Yevgeny, > The ibstat results: > CA 'mthca0' > CA type: MT25208 (MT23108 compat mode) What you have is InfiniHost III HCA, which is 4x SDR card. This card has theoretical peak of 10 Gb/s, which is 1GB/s in IB bit coding. > And more interest

Re: [OMPI users] Infiniband performance Problem and stalling

2012-09-07 Thread Randolph Pullen
: [OMPI users] Infiniband performance Problem and stalling On 9/3/2012 4:14 AM, Randolph Pullen wrote: > No RoCE, Just native IB with TCP over the top. Sorry, I'm confused - still not clear what is "Melanox III HCA 10G card". Could you run "ibstat" and post the results?

Re: [OMPI users] Infiniband performance Problem and stalling

2012-09-07 Thread Randolph Pullen
2012 6:03 PM Subject: Re: [OMPI users] Infiniband performance Problem and stalling On 9/3/2012 4:14 AM, Randolph Pullen wrote: > No RoCE, Just native IB with TCP over the top. Sorry, I'm confused - still not clear what is "Melanox III HCA 10G card". Could you run "ibstat&qu

Re: [OMPI users] Infiniband performance Problem and stalling

2012-09-06 Thread Yevgeny Kliteynik
--- > *From:* Yevgeny Kliteynik > *To:* Randolph Pullen ; Open MPI Users > > *Sent:* Sunday, 2 September 2012 10:54 PM > *Subject:* Re: [OMPI users] Infiniband performance Problem and stalling > > Randolph, > > Some clarification on the setup: > > &q

Re: [OMPI users] Infiniband performance Problem and stalling

2012-09-02 Thread Randolph Pullen
ay, 2 September 2012 10:54 PM Subject: Re: [OMPI users] Infiniband performance Problem and stalling Randolph, Some clarification on the setup: "Melanox III HCA 10G cards" - are those ConnectX 3 cards configured to Ethernet? That is, when you're using openib BTL, you mean RoCE, right?

Re: [OMPI users] Infiniband performance Problem and stalling

2012-09-02 Thread Yevgeny Kliteynik
Randolph, Some clarification on the setup: "Melanox III HCA 10G cards" - are those ConnectX 3 cards configured to Ethernet? That is, when you're using openib BTL, you mean RoCE, right? Also, have you had a chance to try some newer OMPI release? Any 1.6.x would do. -- YK On 8/31/2012 10:53 AM,

[OMPI users] Infiniband performance Problem and stalling

2012-08-31 Thread Randolph Pullen
(reposted with consolidated information) I have a test rig comprising 2 i7 systems 8GB RAM with Melanox III HCA 10G cards running Centos 5.7 Kernel 2.6.18-274 Open MPI 1.4.3 MLNX_OFED_LINUX-1.5.3-1.0.0.2 (OFED-1.5.3-1.0.0.2): On a Cisco 24 pt switch   Normal performance is: $ mpirun --mca btl openi

Re: [OMPI users] Infiniband performance Problem and stalling

2012-08-30 Thread Randolph Pullen
ubject: Re: [OMPI users] Infiniband performance Problem and stalling Randolph, after reading this: On 08/28/12 04:26, Randolph Pullen wrote: > - On occasions it seems to stall indefinately, waiting on a single receive. ... I would make a blind guess: are you aware about IB card parameters fo

Re: [OMPI users] Infiniband performance Problem and stalling

2012-08-29 Thread Randolph Pullen
iband performance Problem and stalling Randolph, after reading this: On 08/28/12 04:26, Randolph Pullen wrote: > - On occasions it seems to stall indefinately, waiting on a single receive. ... I would make a blind guess: are you aware about IB card parameters for registered memory? http://www.open

Re: [OMPI users] Infiniband performance Problem and stalling

2012-08-28 Thread Paul Kapinos
Randolph, after reading this: On 08/28/12 04:26, Randolph Pullen wrote: - On occasions it seems to stall indefinately, waiting on a single receive. ... I would make a blind guess: are you aware about IB card parameters for registered memory? http://www.open-mpi.org/faq/?category=openfabrics#

[OMPI users] Infiniband performance Problem and stalling

2012-08-27 Thread Randolph Pullen
I have a test rig comprising 2 i7 systems with Melanox III HCA 10G cards running Centos 5.7 Kernel 2.6.18-274 Open MPI 1.4.3 MLNX_OFED_LINUX-1.5.3-1.0.0.2 (OFED-1.5.3-1.0.0.2): On a Cisco 24 pt switch Normal performance is: $ mpirun --mca btl openib,self -n 2 -hostfile mpi.hosts  PingPong results