On Nov 29, 2006, at 8:44 AM, Scott Atchley wrote:

My last few runs all completed successfully without hanging. The job
I am currently running just hung one node (can respond to ping,
cannot ssh into it, cannot use any terminals connected to it).

There are no messages in dmesg and vmstat shows that the node is not
swapping (before it hung).

Any ideas where I should look next?

Scott

I just had another job hang at the start of the HPL portion. As before, I do not see anything in dmesg to indicate any problems. vmstat did not show any paging (60% of memory free).

Scott

Reply via email to