[OMPI users] checkpoint problem

2012-07-23 Thread
 Hi all,How can I create ckpt files regularly? I mean, do checkpoint every 100 seconds. Is there any options to do this? Or I have to write a script myself?THANKS,---CHEN SongR&D DepartmentNational Supercomputer Center in TianjinBinhai New Area, Tianjin, China

[OMPI users] 回复: [OMPI users] Fault Tolerant Features in OpenMPI

2012-06-25 Thread
feature or have a strong use case for a set of features, then that is important information for the Open MPI developer community. This will help use as a project prioritize the maintenance of various features in the Open MPI project. Best of luck, Josh On Wed, Jun 20, 2012 at 2:59 AM, 陈松 <chens

[OMPI users] 回复: Re: [OMPI users] 2012/06/18 14:35:07 自动保存草稿

2012-06-20 Thread
not needed so we (read this as the official version of Open MPI) do not support it.However, a group of researchers have been working toward a version of Open MPI that supports the last fault tolerance proposal submitted for consideration to the MPI Forum. You can access it at https://bitbucket.org/jj

[OMPI users] 2012/06/18 14:35:07 自动保存草稿

2012-06-19 Thread
Hi all,Can anyone explain me the fault tolerant features in OpenMPI? I've read the FAQs and some papers about this topic listed in open-mpi.org, but still can't figure out when one node of my supercomputer system fails down during computing, what would happen with the fault tolerant mechanism in