subject:"Re\: job cancelled because of management server restart"

Re: job cancelled because of management server restart

2013-09-04 Thread Chip Childers

On Tue, Sep 03, 2013 at 10:53:14PM +, Kelven Yang wrote: > This is a design issue that we need to improve in general. However, a > simple roll back logic does not solve the problem, since abnormal > terminate can happen at any time, which means it can happen in the middle > of job cancellation

Re: job cancelled because of management server restart

2013-09-03 Thread Kelven Yang

This is a design issue that we need to improve in general. However, a simple roll back logic does not solve the problem, since abnormal terminate can happen at any time, which means it can happen in the middle of job cancellation process as well. Under current architecture, the cleanup work is han

Re: job cancelled because of management server restart

2013-09-03 Thread Wei ZHOU

make sense 2013/9/3 Marcus Sorensen > I'm trying to figure out if/how management and agent restarts are > gracefully handled for long running jobs. My initial testing shows > that maybe they aren't. For example, if I try to migrate a storage > volume, and then restart the management server, I e