We just got bit by https://arc.liv.ac.uk/trac/SGE/ticket/802 and it took me a lot longer to figure it out than it should have in part because there does not appear to be any indication when a job has an array dependency on another job (at least in 6.2u3 which we're using) All holds and dependencies just change the state to hqw apart from -hold_jid which shows up if you run qstat -j on the job. So does anyone have a technique for determining why a job is held (other than running qstat -j to detect hold_jid and experimentally releasing other holds to see what happens).
William _______________________________________________ users mailing list [email protected] https://gridengine.org/mailman/listinfo/users
