On Tue, Sep 03, 2013 at 10:53:14PM +, Kelven Yang wrote:
> This is a design issue that we need to improve in general. However, a
> simple roll back logic does not solve the problem, since abnormal
> terminate can happen at any time, which means it can happen in the middle
> of job cancellation
This is a design issue that we need to improve in general. However, a
simple roll back logic does not solve the problem, since abnormal
terminate can happen at any time, which means it can happen in the middle
of job cancellation process as well.
Under current architecture, the cleanup work is han
make sense
2013/9/3 Marcus Sorensen
> I'm trying to figure out if/how management and agent restarts are
> gracefully handled for long running jobs. My initial testing shows
> that maybe they aren't. For example, if I try to migrate a storage
> volume, and then restart the management server, I e