OK, I'll back off a bit and generalize... where else would I look to see hints of processes dying? Would I see anything anywhere else in the grid engine environment other than the exec host 'messages' file in the sge spool, or the qmaster 'messages' file ? (is there some other sge logging I'm missing, I guess is what I'm asking).
So far, the above file(s) aren't telling me much at all, other than lots of these:: 09/22/2012 16:56:09| main|rome|W|reaping job "22139" ptf complains: Job does not exist _______________________________________________ users mailing list [email protected] https://gridengine.org/mailman/listinfo/users
