[OMPI users] High Checkpoint Overhead Ratio

2010-08-30 Thread
Dear OMPI Users, I’m now using BLCR-0.8.2 and OpenMPI-1.5rc5. The problem is that it takes a very long time to checkpoint. BLCR configuration: ./onfigure --prefix=/opt/blcr --enable-static OpenMPi configuration: ./configure --prefix=/opt/ompi --with-ft=cr --with-blcr=/opt/blcr --enable-s

[OMPI users] Checkpoint problem with BLCR + OpenMPI

2010-08-27 Thread
Dear OMPI Users, I have installed BLCR(0.8.2) and OpenMPI(1.4.2) successfully. But now I met a problem when I take a checkpoint. I run CG NPB(NPROCS=16, two nodes: blade02 & blade04, CLASS=C, NFS: $HOME & /opt are shared) BLCR configure: ./configure �Cprefix=/opt/blcr �Cenable-static Open

Re: [OMPI users] OpenMPI with BLCR runtime problem

2010-08-25 Thread
I was so careless. BLCR Admin Guide says: as the root, load the kernel modules in this order: # /sbin/insmod /usr/local/lib/blcr/2.6.12-1.234/blcr_imports.ko # /sbin/insmod /usr/local/lib/blcr/2.6.12-1.234/blcr.ko In the last email, I load the kernel in the wrong order. And I followed the o

Re: [OMPI users] OpenMPI with BLCR runtime problem

2010-08-25 Thread
I really thank you for your advice, Josh. As you say, when check 'lsmod | grep blcr' on blade02, nothing shows. That means no blcr module is inserted on blade02. I think that's the main reason why I can't C/R mpi programs on these two nodes. But here is the problem: I installed blcr under /opt/blcr

[OMPI users] OpenMPI with BLCR runtime problem

2010-08-24 Thread
Dear OMPI users, I configured and installed OpenMPI-1.4.2 and BLCR-0.8.2. (blade01 �C blade10, nfs) BLCR configure script: ./configure �Cprefix=/opt/blcr �Cenable-static After the installation, I can see the ‘blcr’ module loaded correctly (lsmod | grep blcr). And I can also run ‘cr_run’, ‘cr_