Re: [OMPI users] Checkpointing an MPI application with OMPI

2013-01-29 Thread Ralph Castain
On Jan 28, 2013, at 10:53 AM, Maxime Boissonneault wrote: > While our filesystem and management nodes are on UPS, our compute nodes are > not. With one average generic (power/cooling mostly) failure every one or two > months, running for weeks is just asking for trouble. If you add to that >

Re: [OMPI users] problem with rankfile and openmpi-1.6.4rc3r27923

2013-01-29 Thread Ralph Castain
Aha - I'm able to replicate it, will fix. On Jan 29, 2013, at 11:57 AM, Ralph Castain wrote: > Using an svn checkout of the current 1.6 branch, if works fine for me: > > [rhc@odin ~/v1.6]$ cat rf > rank 0=odin127 slot=0:0-1,1:0-1 > rank 1=odin128 slot=1 > > [rhc@odin ~/v1.6]$ mpirun -n 2 -rf .

Re: [OMPI users] problem with rankfile and openmpi-1.6.4rc3r27923

2013-01-29 Thread Ralph Castain
Using an svn checkout of the current 1.6 branch, if works fine for me: [rhc@odin ~/v1.6]$ cat rf rank 0=odin127 slot=0:0-1,1:0-1 rank 1=odin128 slot=1 [rhc@odin ~/v1.6]$ mpirun -n 2 -rf ./rf --report-bindings hostname [odin127.cs.indiana.edu:12078] MCW rank 0 bound to socket 0[core 0-1] socket 1

[OMPI users] problem with rankfile and openmpi-1.6.4rc3r27923

2013-01-29 Thread Siegmar Gross
Hi today I have installed openmpi-1.6.4rc3r27923. Unfortunately I still have a problem with rankfiles, if I start a process on a remote machine. tyr rankfiles 114 ssh linpc1 ompi_info | grep "Open MPI:" Open MPI: 1.6.4rc3r27923 tyr rankfiles 115 cat rf_linpc1 rank 0=linpc1 slot

Re: [OMPI users] Basic question about MPI

2013-01-29 Thread Jens Glaser
On Jan 29, 2013, at 12:02 AM, Pradeep Jha wrote: > Thank you for your response. That makes it clear. > > A related question. When I run a general program on a machine, say a Internet > browser/Media player to watch a movie by clicking on the icon of the avi file > in the folder (nothing from

Re: [OMPI users] Basic question about MPI

2013-01-29 Thread Pradeep Jha
Thank you for your response. That makes it clear. A related question. When I run a general program on a machine, say a Internet browser/Media player to watch a movie by clicking on the icon of the avi file in the folder (nothing from the terminal), how many cores does it use? In that case also doe

Re: [OMPI users] Basic question about MPI

2013-01-29 Thread Jens Glaser
Hi Pradeep, On Jan 28, 2013, at 11:16 PM, Pradeep Jha wrote: > I have a very basic question about MPI. > > I have a computer with 8 processors (each with 8 cores). What is the > difference between if I run a program simply by "./program" and "mpirun -np 8 > /path/to/program" ? In the first c

[OMPI users] Basic question about MPI

2013-01-29 Thread Pradeep Jha
Hello, I have a very basic question about MPI. I have a computer with 8 processors (each with 8 cores). What is the difference between if I run a program simply by "./program" and "mpirun -np 8 /path/to/program" ? In the first case does the program just use one processor out of the 8? If I