Hi Lydia
I would like to say we clean up perfectly, but :-(
The system does try its best. I'm a little surprised here since we usually
clean up when an application process fails. Our only known problems are when
one or more of the orteds fail, usually due to a node rebooting or failing.
We ho
A job which crashes with an floating point underflow (or any IEEE floating point
exception) fails to clean up after itself using
openmpi-1.3a1r12695 ..
Nodes with copies of slaves are sitting there ...
I also noticed that orted are left behind on other crashed jobs ..
Should I have to expect t