Le 29/04/2015 14:53, Noam Bernstein a écrit :
> They’re dual 8-core processor, so the 16 cores are physical ones. lstopo
> output looks identical on nodes where this does happen, and nodes where it
> never does. My next step is to see if I can reproduce the behavior at will -
> I’m still not quite sure if it’s consistent. There are clearly nodes where
> it never happens (most of them), but even on the nodes where it does happen,
> I don’t know if it happens every time.
Thanks. It's indeed normal that OMPI failed to bind to cpuset 0,16 since
16 doesn't exist at all.
Can you run "lstopo foo.xml" on one node where it failed, and send the
foo.xml that got generated? Just want to make sure we don't have invalid
cpusets in there.
Brice