Re: [OMPI users] Binding blocks of processes in round-robin fashion

2021-01-29 Thread Ralph Castain via users
Okay, I can't promise when I'll get to it, but I'll try to have it in time for OMPI v5. On Jan 29, 2021, at 1:30 AM, Luis Cebamanos via users mailto:users@lists.open-mpi.org> > wrote: Hi Ralph, It would be great to have it for load balancing issues. Ideally one could do something like --

Re: [OMPI users] Binding blocks of processes in round-robin fashion

2021-01-29 Thread Luis Cebamanos via users
Hi Ralph, It would be great to have it for load balancing issues. Ideally one could do something like --bind-to:N where N is the block size, 4 in this case. mpirun -np 40  --map-by ppr:40:node  --bind-to core:4  I think it would be interesting to have it. Of course, I can always use srun but not

Re: [OMPI users] Binding blocks of processes in round-robin fashion

2021-01-28 Thread Ralph Castain via users
Hmmm...well, the proc distribution is easy as you would just --map-by node. The tricky thing is assigning the ranks in the pattern you desire. We definitely don't have that pattern in our ranking algo today, though it wouldn't be hard to add. However, that wouldn't be available until OMPI v5 wa

Re: [OMPI users] Binding blocks of processes in round-robin fashion

2021-01-28 Thread Luis Cebamanos via users
That's right Ralph! On 28/01/2021 23:13, Ralph Castain via users wrote: Trying to wrap my head around this, so let me try a 2-node example. You want (each rank bound to a single core): ranks 0-3 to be mapped onto node1 ranks 4-7 to be mapped onto node2 ranks 8-11 to be mapped onto node1 ranks

Re: [OMPI users] Binding blocks of processes in round-robin fashion

2021-01-28 Thread Ralph Castain via users
Trying to wrap my head around this, so let me try a 2-node example. You want (each rank bound to a single core): ranks 0-3 to be mapped onto node1 ranks 4-7 to be mapped onto node2 ranks 8-11 to be mapped onto node1 ranks 12-15 to be mapped onto node2 etc.etc. Correct? > On Jan 28, 2021, at 3:0

[OMPI users] Binding blocks of processes in round-robin fashion

2021-01-28 Thread Luis Cebamanos via users
Hello all, What are the options for binding MPI tasks on a blocks of cores per node/socket/numa in a round-robin fashion? Say I want to fully populate 40 core sockets on dual-socket nodes but in a round-robin fashion binding 4 cores on the first node, then 4 cores on the next, and so on.  Woul