[OMPI users] Automatic checkpoint/restart in OpenMPI

2009-04-20 Thread ESTEBAN MENESES ROJAS
   Hello.    Is there any way to automatically checkpoint/restart an application in OpenMPI? This is, checkpointing the application without using the command ompi-checkpoint, perhaps via a function call in the application's code itself. The same with the restart after a failure.    On a related

Re: [OMPI users] Problem with running openMPI program

2009-04-20 Thread Gus Correa
Hi Amjad, list HPL has some quirks to install, as I just found out. It can be done, though. I had used a precompiled version of HPL on my Rocks cluster before, but that version is no longer being distributed, unfortunately. Go to the HPL "setup" directory, and run the script "make_generic". This

Re: [OMPI users] OpenMPI 1.2 rank?

2009-04-20 Thread Ralph Castain
Yes, it should - at least, for the more common environments (e.g., ssh). On Apr 20, 2009, at 4:25 PM, Ross Boylan wrote: On Mon, 2009-04-20 at 16:22 -0600, Ralph Castain wrote: Afraid there really isn't anything in the 1.2.x series - we set several MPI-specific envars beginning with 1.3.0, but

Re: [OMPI users] OpenMPI 1.2 rank?

2009-04-20 Thread Ross Boylan
On Mon, 2009-04-20 at 16:22 -0600, Ralph Castain wrote: > Afraid there really isn't anything in the 1.2.x series - we set > several MPI-specific envars beginning with 1.3.0, but not in the older > releases. > > The problem with using something like OMPI_MCA_ns_nds_vpid is that we > are free

Re: [OMPI users] OpenMPI 1.2 rank?

2009-04-20 Thread Ralph Castain
Afraid there really isn't anything in the 1.2.x series - we set several MPI-specific envars beginning with 1.3.0, but not in the older releases. The problem with using something like OMPI_MCA_ns_nds_vpid is that we are free to change/eliminate it at any time - in fact, you won't find that

[OMPI users] OpenMPI 1.2 rank?

2009-04-20 Thread Ross Boylan
How do I determine my rank in a shell script under OpenMPI 1.2? The only thing I've found that looks promising is the environment variable OMPI_MCA_ns_nds_vpid, and earlier discussion on this list said that was for "internal use only". I'm on Debian Lenny, which just relased with openmpi 1.2.7~rc2

Re: [OMPI users] 1.3.1 -rf rankfile behaviour ??

2009-04-20 Thread Ralph Castain
I'm afraid this is a more extensive rewrite than I had hoped - the revisions are most unlikely to make it for 1.3.2. Looks like it will be 1.3.3 at the earliest. Ralph On Mon, Apr 20, 2009 at 7:50 AM, Lenny Verkhovsky < lenny.verkhov...@gmail.com> wrote: > Me too, sorry, it definately seems like

Re: [OMPI users] Problem with running openMPI program

2009-04-20 Thread Prakash Velayutham
Hi Ankush, You can get some example MPI programs from http://www.pdc.kth.se/training/Tutor/MPI/Templates/index-frame.html . You can compare the performance of these in a MPI (single processor, multiple processors) setting and non-MPI (serial) setting to show how it can help their research.

Re: [OMPI users] Problem with running openMPI program

2009-04-20 Thread Eugene Loh
Ankush Kaul wrote: let me describe what i want to do. i had taken linux clustering as my final year engineering project as i m really iintrested in 0networking. to tell de truth our college does not have any professor with knowledge of clustering. the aim of our project was just to make a

Re: [OMPI users] Problem with running openMPI program

2009-04-20 Thread Ankush Kaul
let me describe what i want to do. i had taken linux clustering as my final year engineering project as i m really iintrested in 0networking. to tell de truth our college does not have any professor with knowledge of clustering. the aim of our project was just to make a cluster, which we did. no

Re: [OMPI users] Problem with running openMPI program

2009-04-20 Thread Gus Correa
Hi Ankush Ankush Kaul wrote: Thanks a lot, I m implementing the passwordless cluster I m also tryin different benchmarking software n got fed up of all the probs in all de sofwares i try. will list few: *1) VampirTrace* I extracted de tar in /vt then followed following steps I never use

Re: [OMPI users] Problem with running openMPI program

2009-04-20 Thread Ankush Kaul
Thanks a lot, I m implementing the passwordless cluster I m also tryin different benchmarking software n got fed up of all the probs in all de sofwares i try. will list few: *1) VampirTrace* I extracted de tar in /vt then followed following steps *$ ./configure --prefix=/vti* [...lots

Re: [OMPI users] Problem with running openMPI program

2009-04-20 Thread Gus Correa
Hi Ankush Please read the FAQ I sent you in the previous message. That is the answer to your repeated question. OpenMPI (and all MPIs that I know of) requires passwordless connections. Your program fails because you didn't setup that. If it worked with a single compute node, that was most likely

Re: [OMPI users] 1.3.1 -rf rankfile behaviour ??

2009-04-20 Thread Lenny Verkhovsky
Me too, sorry, it definately seems like a bug. Somewere in the code probably undefined variable. I just never tested this code with such "bizzare" command line :) Lenny. On Mon, Apr 20, 2009 at 4:08 PM, Geoffroy Pignot wrote: > Thanks, > > I am not in a hurry but it would be nice if I could bene

Re: [OMPI users] Problem with running openMPI program

2009-04-20 Thread Gus Correa
Hi Ankush Ankush Kaul wrote: Also how can i find out where are my mpi libraries and include directories? If you configured OpenMPI with --prefix=/some/dir they are in /some/dir/lib and /some/dir/include, whereas the executables (mpicc, mpiexec, etc) are in /some/dir/bin. Otherwise OpenMPI defa

Re: [OMPI users] 1.3.1 -rf rankfile behaviour ??

2009-04-20 Thread Geoffroy Pignot
Thanks, I am not in a hurry but it would be nice if I could benefit from this feature in the next release. Regards Geoffroy 2009/4/20 > Send users mailing list submissions to >us...@open-mpi.org > > To subscribe or unsubscribe via the World Wide Web, visit >http://www.open-mp

Re: [OMPI users] 1.3.1 -rf rankfile behaviour ??

2009-04-20 Thread Ralph Castain
Honestly haven't had time to look at it yet - hopefully in the next couple of days... Sorry for delay On Apr 20, 2009, at 2:58 AM, Geoffroy Pignot wrote: Do you have any news about this bug. Thanks Geoffroy Message: 1 Date: Tue, 14 Apr 2009 07:57:44 -0600 From: Ralph Castain Subject: Re

Re: [OMPI users] 1.3.1 -rf rankfile behaviour ??

2009-04-20 Thread Geoffroy Pignot
Do you have any news about this bug. Thanks Geoffroy > > Message: 1 > Date: Tue, 14 Apr 2009 07:57:44 -0600 > From: Ralph Castain > Subject: Re: [OMPI users] 1.3.1 -rf rankfile behaviour ?? > To: Open MPI Users > Message-ID: > Content-Type: text/plain; charset="us-ascii"; Format="flowed"; >