IIRC, I added relative node syntax to the rankfile mapper a while back. You 
should then be able to specify the rank->node mapping by indicating that (e.g.) 
rank 0 -> first node in file, instead of specifying the node by name.


On Jun 16, 2010, at 2:21 PM, Brock Palen wrote:

> We use Torque and Moab as our resource manager, 
> I have a user who has to mix programming models, threads and MPI, but not on 
> all ranks,
> 
> Example:
> Rank 0 needs 4 cores
> Rank 1 needs 4 cores
> Ranks 2-5 need 1
> 
> There is no way to give this to Torque/Moab in a way that when using TM 
> openMPI correctly starts the layout I need.
> I found I can use a rankfile to do this,
> 
> cat rankfile
> rank 0=nyx0456 slot=0
> rank 1=nyx0457 slot=0
> rank 2=nyx0458 slot=0
> rank 3=nyx0458 slot=1
> rank 4=nyx0458 slot=2
> rank 5=nyx0458 slot=3
> 
> qsub -I -l nodes=4:ppn=4
> [brockp@nyx0456 tmp]$ mpirun -np 6 -rf rankfile hostname | sort | uniq -c
>      1 nyx0456.engin.umich.edu
>      1 nyx0457.engin.umich.edu
>      4 nyx0458.engin.umich.edu
> 
> (ignore the cpu count mismatch, I am not using all 16 cores in this example)
> 
> I could make a script to build this rankfile, I want to know if there is a 
> better way to do this, 
> Also we don't want to use affinity, as we don't know what 'cpus' the bath 
> (pbs/moab) system gave us,  is the above rankfile going to do the right 
> thing? That is the processes will float? Is there a way to tell if affinity 
> is set? Or to say "rank 0 goes to this host use any available cores" 
> 
> Thanks!
> 
> Brock Palen
> www.umich.edu/~brockp
> Center for Advanced Computing
> bro...@umich.edu
> (734)936-1985
> 
> 
> 
> 
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/users


Reply via email to