i have a question about the checkpoint/restart facility of BLCR with OPEN MPI ,
does the checkpoint/restart solution as a whole can be considered as a
coordinated or uncoordinated approach
i am trying to run Blcr with Open mpi on a cluster of 4 nodes
blcr version 0.8.5
when i run the command :
mpirun -np 4 -am ft-enable-cr -hostfile hosts
/home/ubuntu//N/NPB3.3-MPI/bin/bt.B.4
i got this error :
-
It looks like opal_init failed for some reason;
can i run open mpi application on hadoop ?i need to test open mpi over
cloudwhat other cloud alternatives i can use to test open mpi over cloud
thank you
i need to use DMTCP to checkpoint and restart OpenMPI program
which version of Open MPI is working fine with Dmtcp?if any
Any Suggestions
From: basmaabdelaz...@hotmail.com
To: us...@open-mpi.org
Subject: FT problem
Date: Wed, 18 Sep 2013 16:42:29 +0200
i am using openmpi-1.6.1
i need to try checkpoint restart ( self , blcr )
after i installed openmpi i had the following in my installation folder :
bin\ ompi-ch
i am using openmpi-1.6.1
i need to try checkpoint restart ( self , blcr )
after i installed openmpi i had the following in my installation folder :
bin\ ompi-checkpoint
bin\ompi-restart
lib\openmpi\mca_crs_self.la
lib\openmpi\mca_crs_self.so
lib\openmpi
sorry for the trivial question
i am new to open mpi and parallel computing
i installed openmpi-1.6.1 on my pc which has an ubuntu 12.10
also i have nas parallel benchmark , i need to edit the NPB make file
"make.def"
i need to know what is the linker library file for both fortran and c compilers