Hi, I *think* the problem is in the sage-cleaner, specifically the following code:
def kill_spawned_jobs(file, parent_pid): #print "killing %s's spawned jobs"%parent_pid killed_them_all = True for L in open(file).readlines(): #print L i = L.find(' ') pid = L[:i].strip() cmd = L[i+1:].strip() try: #print "SAGE-Cleaner: trying to kill %s with parent %s"% (pid, parent_pid) os.killpg(int(pid), 9) except OSError, msg: try: os.kill(int(pid), 9) except OSError, msg: pass pass if is_running(pid): #print "try again later to kill %s"%pid # try again later killed_them_all = False return killed_them_all Instead of "os.killpg(int(pid), 9)" we should probably use the shell's kill directly since when I see jobs running amok on sage.math for example a "kill -9 $PID" has killed them every time. We might also want reenable a little bit of debug output in the above file [maybe conditionally with in case some env variable is set] so that we can collect some data and hopefully see the pattern. Cheers, Michael --~--~---------~--~----~------------~-------~--~----~ To post to this group, send email to sage-support@googlegroups.com To unsubscribe from this group, send email to [EMAIL PROTECTED] For more options, visit this group at http://groups.google.com/group/sage-support URLs: http://www.sagemath.org -~----------~----~----~----~------~----~------~--~---