Thanks for your answer. Your example address one possible situation
where a parallel
application is spawned by a driver with MPI_Comm_Spawn, or multiple
parallel applications
are spawned at the same time with a MPI_Comm_Span_Multiple, over a set
of processors
described in the machinefile. It is
so for my tests, Open MPI did follow the machinefile (see output)
further below, however, for each spawn operation it starts from the very
beginning of the machinefile...
The following example spawns 5 child processes (with a single
MPI_Comm_spawn), and each child prints its rank and the hostname
as far as I know, Open MPI should follow the machinefile for spawn
operations, starting however for every spawn at the beginning of the
machinefile again. An info object such as 'lam_sched_round_robin' is
currently not available/implemented. Let me look into this...
Jean Latour wrote:
Hello,