I would be happy to try this out with a patched version which has some debugging output.
John 2008/5/31 mabshoff <[EMAIL PROTECTED]>: > > Hi, > > I *think* the problem is in the sage-cleaner, specifically the > following code: > > def kill_spawned_jobs(file, parent_pid): > #print "killing %s's spawned jobs"%parent_pid > killed_them_all = True > for L in open(file).readlines(): > #print L > i = L.find(' ') > pid = L[:i].strip() > cmd = L[i+1:].strip() > try: > #print "SAGE-Cleaner: trying to kill %s with parent %s"% > (pid, parent_pid) > os.killpg(int(pid), 9) > except OSError, msg: > try: > os.kill(int(pid), 9) > except OSError, msg: > pass > pass > if is_running(pid): > #print "try again later to kill %s"%pid > # try again later > killed_them_all = False > return killed_them_all > > > Instead of "os.killpg(int(pid), 9)" we should probably use the shell's > kill directly since when I see jobs running amok on sage.math for > example a "kill -9 $PID" has killed them every time. We might also > want reenable a little bit of debug output in the above file [maybe > conditionally with in case some env variable is set] so that we can > collect some data and hopefully see the pattern. > > Cheers, > > Michael > > > --~--~---------~--~----~------------~-------~--~----~ To post to this group, send email to sage-support@googlegroups.com To unsubscribe from this group, send email to [EMAIL PROTECTED] For more options, visit this group at http://groups.google.com/group/sage-support URLs: http://www.sagemath.org -~----------~----~----~----~------~----~------~--~---