Re: [OMPI users] errors in mpiexec command with "-loadbalance" options

2010-07-16 Thread Ralph Castain
Yes, 1.4.2 has a bug in loadbalance and the npernode options. These are corrected in an upcoming 1.4.3 release. Meantime, you can download a pre-release (but good) snapshot from the OMPI site: http://www.open-mpi.org/nightly/v1.4/ On Jul 16, 2010, at 5:12 PM, Qiyang Hu wrote: > Hello, everyon

[OMPI users] errors in mpiexec command with "-loadbalance" options

2010-07-16 Thread Qiyang Hu
Hello, everyone, We got some "Segmentation fault" errors in running "mpiexec" with "-loadbalance" option (details see below). However, we don't have any problem in using "-bynode" or "-nolocal". We tried in both Intel compiler and GCC 4.1.2. The same type of error appears. Here is the error

Re: [OMPI users] ompi-restart, ompi-ps problem

2010-07-16 Thread Josh Hursey
(Sorry for the late reply) On Jun 7, 2010, at 4:48 AM, Nguyen Kim Son wrote: > Hello, > > I'n trying to get functions like orte-checkpoint, orte-restart,... works but > there are some errors that I don't have any clue about. > > Blcr (0.8.2) works fine apparently and I have installed openmpi

Re: [OMPI users] ompi-restart failed

2010-07-16 Thread Josh Hursey
Open MPI can restart multi-threaded applications on any number of nodes (I do this routinely in testing). If you are still experiencing this problem (sorry for the late reply), can you send me the MCA parameters that you are using, command line, and a backtrace from the corefile generated by th

Re: [OMPI users] How to checkpoint atomic function in OpenMPI

2010-07-16 Thread Josh Hursey
On Jun 14, 2010, at 5:26 AM, Nguyen Toan wrote: > Hi all, > I have a MPI program as follows: > --- > int main(){ >MPI_Init(); >.. >for (i=0; i<1; i++) { > my_atomic_func(); >} >... >MPI_Finalize(); >return 0; > } > >

Re: [OMPI users] Question on checkpoint overhead in Open MPI

2010-07-16 Thread Josh Hursey
The amount of checkpoint overhead is application and system configuration specific. So it is impossible to give you a good answer to how much checkpoint overhead to expect for your application and system setup. BLCR is only used to capture the single process image. The coordination of the distr

Re: [OMPI users] do all processors have to execute MPI_Bcast?

2010-07-16 Thread Prentice Bisbal
Eugene Loh wrote: > Anton Shterenlikht wrote: > >> Will this bit of code work: >> >> if (rank .eq. ) then >> >> *change var* >> >> call MPI_Bcast(var, 1, MPI_INTEGER, rank, & >> MPI_COMM_WORLD, errstat) >> end if >> >> call MPI_Barrier(MPI_COMM_WORLD, errs

Re: [OMPI users] do all processors have to execute MPI_Bcast?

2010-07-16 Thread Eugene Loh
Anton Shterenlikht wrote: Will this bit of code work: if (rank .eq. ) then *change var* call MPI_Bcast(var, 1, MPI_INTEGER, rank, & MPI_COMM_WORLD, errstat) end if call MPI_Barrier(MPI_COMM_WORLD, errstat) I'm u

[OMPI users] do all processors have to execute MPI_Bcast?

2010-07-16 Thread Anton Shterenlikht
Will this bit of code work: if (rank .eq. ) then *change var* call MPI_Bcast(var, 1, MPI_INTEGER, rank, & MPI_COMM_WORLD, errstat) end if call MPI_Barrier(MPI_COMM_WORLD, errstat) I'm using a int variable on all pr

Re: [OMPI users] [openib] segfault when using openib btl

2010-07-16 Thread Eloi Gaudry
Hi Edgar, The only difference I could observed was that the segmentation fault appeared sometimes later during the parallel computation. I'm running out of idea here. I wish I could use the "--mca coll tuned" with "--mca self,sm,tcp" so that I could check that the issue is not somehow limited

Re: [OMPI users] orted unknown option "--daemonize"

2010-07-16 Thread TH Chew
Jeff, An update of what I did. Apparently, one of my lab mates installed another version of OpenMPI manually and it clashed with the OpenMPI I installed from the Ubuntu repository. I manually identified the files installed and deleted them. After I installed OpenMPI from Ubuntu repository, my "mpi

Re: [OMPI users] orted unknown option "--daemonize"

2010-07-16 Thread TH Chew
Jeff, Thanks for the suggestion. Been looking into it and although, I installed the same OpenMPI version. But somehow, another software (Discovery Studio) was installed on birg-desktop-10, causing the mpirun to be messed up (since Discovery Studio also install some kind of mpirun/mpiexec). I type