Just to clarify: I am not aware of any MPI that will allow you to relocate a process while it is running. You have to checkpoint the job, terminate it, and then restart the entire thing with the desired process on the new node.
> On Mar 16, 2016, at 3:15 AM, Husen R <hus...@gmail.com> wrote: > > In the case of MPI application (not gromacs), How do I relocate MPI > application from one node to another node while it is running ? > I'm sorry, as far as I know the ompi-restart command is used to restart > application, based on checkpoint file, once the application already > terminated (no longer running). > > Thanks > > regards, > > Husen > > On Wed, Mar 16, 2016 at 4:29 PM, Jeff Hammond <jeff.scie...@gmail.com > <mailto:jeff.scie...@gmail.com>> wrote: > Just checkpoint-restart the app to relocate. The overhead will be lower than > trying to do with MPI. > > Jeff > > > On Wednesday, March 16, 2016, Husen R <hus...@gmail.com > <mailto:hus...@gmail.com>> wrote: > Hi Jeff, > > Thanks for the reply. > > After consulting the Gromacs docs, as you suggested, Gromacs already supports > checkpoint/restart. thanks for the suggestion. > > Previously, I asked about checkpoint/restart in Open MPI because I want to > checkpoint MPI Application and restart/migrate it while it is running. > For the example, I run MPI application in node A,B and C in a cluster and I > want to migrate process running in node A to other node, let's say to node C. > is there a way to do this with open MPI ? thanks. > > Regards, > > Husen > > > > > On Wed, Mar 16, 2016 at 12:37 PM, Jeff Hammond <jeff.scie...@gmail.com <>> > wrote: > Why do you need OpenMPI to do this? Molecular dynamics trajectories are > trivial to checkpoint and restart at the application level. I'm sure Gromacs > already supports this. Please consult the Gromacs docs or user support for > details. > > Jeff > > > On Tuesday, March 15, 2016, Husen R <hus...@gmail.com <>> wrote: > Dear Open MPI Users, > > > Does the current stable release of Open MPI (v1.10 series) support fault > tolerant feature ? > I got the information from Open MPI FAQ that The checkpoint/restart support > was last released as part of the v1.6 series. > I just want to make sure about this. > > and by the way, does Open MPI able to checkpoint or restart mpi > application/GROMACS automatically ? > Please, I really need help. > > Regards, > > > Husen > > > -- > Jeff Hammond > jeff.scie...@gmail.com <> > http://jeffhammond.github.io/ <http://jeffhammond.github.io/> > > _______________________________________________ > users mailing list > us...@open-mpi.org <> > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users > <http://www.open-mpi.org/mailman/listinfo.cgi/users> > Link to this post: > http://www.open-mpi.org/community/lists/users/2016/03/28705.php > <http://www.open-mpi.org/community/lists/users/2016/03/28705.php> > > > > -- > Jeff Hammond > jeff.scie...@gmail.com <mailto:jeff.scie...@gmail.com> > http://jeffhammond.github.io/ <http://jeffhammond.github.io/> > > _______________________________________________ > users mailing list > us...@open-mpi.org <mailto:us...@open-mpi.org> > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users > <http://www.open-mpi.org/mailman/listinfo.cgi/users> > Link to this post: > http://www.open-mpi.org/community/lists/users/2016/03/28709.php > <http://www.open-mpi.org/community/lists/users/2016/03/28709.php> > > _______________________________________________ > users mailing list > us...@open-mpi.org > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users > Link to this post: > http://www.open-mpi.org/community/lists/users/2016/03/28710.php