On Wed, 20 Nov 2019 17:38:19 +0000 "Mccall, Kurt E. \(MSFC-EV41\) via users" <users@lists.open-mpi.org> wrote:
> Hi, > > My job is behaving differently on its two nodes, refusing to > MPI_Comm_spawn() a process on one of them but succeeding on the > other. ... > Data for node: n002 Num slots: 3 ... Bound: N/A > Data for node: n001 Num slots: 3 ... Bound: > socket 0[core 0[hwt 0]]:[B/././././././././.][./././././././././.] ... > Why is the Bound output different between n001 and n002? Without knowing more details (like what exact openmpi, how exactly did you try to launch) etc. you're not likely to get good answers. But it does seem clear that the process/rank to hardware (core) pinning happened on one but not the other node. This suggests a broken install and/or enviroment and/or non-standard launch. /Peter K