Re: [OMPI users] Open MPI and OpenIB

2006-05-17 Thread Brian Barrett
On May 15, 2006, at 9:14 AM, Gurhan Ozen wrote: Jeff, George, Brian thanks for your inputs in this. I did "kind of" get openib working. Different revisions of kernel was running on both boxes, getting them running on the very same revisions of kernel and recompiling open-mpi with that rev. of k

Re: [OMPI users] Open MPI and OpenIB

2006-05-15 Thread Gurhan Ozen
, May 11, 2006 4:11 PM > To: Open MPI Users > Subject: Re: [OMPI users] Open MPI and OpenIB > > At any rate though, --mca btl ib,self looks like the traffic goes over > ethernet device .. I couldn't find any documentation on the "self" > argument of mca, does it mean

Re: [OMPI users] Open MPI and OpenIB

2006-05-12 Thread Jeff Squyres (jsquyres)
> -Original Message- > From: users-boun...@open-mpi.org > [mailto:users-boun...@open-mpi.org] On Behalf Of Gurhan Ozen > Sent: Thursday, May 11, 2006 4:11 PM > To: Open MPI Users > Subject: Re: [OMPI users] Open MPI and OpenIB > > At any rate though, --mca bt

Re: [OMPI users] Open MPI and OpenIB

2006-05-11 Thread George Bosilca
This message indicate that one of the nodes is not able to setup a route to the peer using the openib device. Did you run any openib tests on your cluster ? I mean any tests which do not involve MPI ? Otherwise if you compile in mode debug there are 2 parameters you can use to get more info

Re: [OMPI users] Open MPI and OpenIB

2006-05-11 Thread Gurhan Ozen
Dagnabbit.. I was specifying ib, not openib .. When i specified openib, I got this error: " -- It looks like MPI_INIT failed for some reason; your parallel process is likely to abort. There are many reasons that a parallel p

Re: [OMPI users] Open MPI and OpenIB

2006-05-11 Thread Brian Barrett
On May 11, 2006, at 10:10 PM, Gurhan Ozen wrote: Brian, Thanks for the very clear answers. I did change my code to include fflush() calls after printf() ... And I did try with --mca btl ib,self . Interesting result, with --mca btl ib,self it hello_world works fine, but broadcast hangs after i

Re: [OMPI users] Open MPI and OpenIB

2006-05-11 Thread Gurhan Ozen
Brian, Thanks for the very clear answers. I did change my code to include fflush() calls after printf() ... And I did try with --mca btl ib,self . Interesting result, with --mca btl ib,self it hello_world works fine, but broadcast hangs after i enter the vector length. At any rate though, --mca

Re: [OMPI users] Open MPI and OpenIB

2006-05-11 Thread Brian Barrett
On May 10, 2006, at 10:46 PM, Gurhan Ozen wrote: My ultimate goal is to get Open MPI working with openIB stack. First, I had installed lam-mpi , I know it doesn't have support for openIB but it's still relevant to some of my questions I will ask.. Here is the set up I have: Yes, keep

[OMPI users] Open MPI and OpenIB

2006-05-10 Thread Gurhan Ozen
Hi Everyone, This is going to be a long email, so please bear with me. The example programs are obtained from lam-mpi.org site ... My ultimate goal is to get Open MPI working with openIB stack. First, I had installed lam-mpi , I know it doesn't have support for openIB but it's still relevant