Just to close the loop on this: I added the binding info to --display-map

On Mar 28, 2014, at 6:58 AM, Jeff Squyres (jsquyres) <jsquy...@cisco.com> wrote:

> Good information; thanks.
> 
> The short reason for this change in behavior of the affinity options is that 
> when we first created affinity (waaaay back in 1.0 days, no one really cared 
> about it much, and so we just did a first attempt).  Gradually over time, 
> affinity has become much more important.  As such, we have learned much from 
> what our users want and how they want to use affinity.  That has caused a few 
> changes in approaches to how we do affinity -- and because our understanding 
> has grown, sometime it means that the changes we've made have been 
> revolutionary (vs. evolutionary), meaning that CLI options change, behaviors 
> change, etc.
> 
> Sorry about that -- it really reflects how the whole HPC community is 
> evolving its attitude towards affinity over time.
> 
> BTW, you should be aware that Open MPI v1.8 -- i.e., the next stable series 
> -- is scheduled to be released on Monday.  There's additional changes with 
> regards to affinity in 1.8 (compared to the v1.6 series); much of what has 
> been discussed on this thread has been in the context of v1.7.x (which is 
> being renamed to 1.8 on Monday, per our "feature series eventually turns into 
> stable series" versioning philosophy).
> 
> 
> 
> On Mar 28, 2014, at 9:47 AM, "Sasso, John (GE Power & Water, Non-GE)" 
> <john1.sa...@ge.com> wrote:
> 
>> Thanks again!  I tried --display-devel-map and I think it provides a bit too 
>> much info for our needs.  However, it is nice to know.
>> 
>> BTW, some interesting behavior in using "--report-bindings --bind-to-core" 
>> vs "--display-map".  
>> 
>> *  If I use "--report-bindings --bind-to-core" but the MPI tasks on a host 
>> fail to start up, then nothing is reported.  For example, I had a problem 
>> where a job started across 4 hosts but the hosts could not communicate with 
>> one another via TCP/IP.
>> 
>> * If I use "--display-map" then the mapping is shown, even in the failure 
>> case I mentioned in the last bullet.
>> 
>> * What is nice about "--report-bindings --bind-to-core" over "--display-map" 
>> is that it will report the binding of each rank to CPU, whereas the latter 
>> will show you what  ranks are running on a given host.  For our needs, this 
>> may be sufficient, tho it would be nice to have the CPU bindings shown as 
>> well
>> 
>> * If using "--report-bindings --bind-to-core" with OpenMPI 1.4.1 then the 
>> bindings on just the head node are shown.  In 1.6.1, full bindings across 
>> all hosts are shown.  (I'd have to read release notes on this...)
>> 
>> --john
>> 
>> 
>> -----Original Message-----
>> From: users [mailto:users-boun...@open-mpi.org] On Behalf Of Ralph Castain
>> Sent: Thursday, March 27, 2014 7:01 PM
>> To: Open MPI Users
>> Subject: Re: [OMPI users] Mapping ranks to hosts (from MPI error messages)
>> 
>> Oooooooh...it's Jeff's fault!
>> 
>> Fwiw you can get even more detailed mapping info with --display-devel-map
>> 
>> Sent from my iPhone
>> 
>>> On Mar 27, 2014, at 2:58 PM, "Jeff Squyres (jsquyres)" <jsquy...@cisco.com> 
>>> wrote:
>>> 
>>>> On Mar 27, 2014, at 4:06 PM, "Sasso, John (GE Power & Water, Non-GE)" 
>>>> <john1.sa...@ge.com> wrote:
>>>> 
>>>> Yes, I noticed that I could not find --display-map in any of the man 
>>>> pages.  Intentional?
>>> 
>>> Oops; nope.  I'll ask Ralph to add it...
>>> 
>>> -- 
>>> Jeff Squyres
>>> jsquy...@cisco.com
>>> For corporate legal information go to: 
>>> http://www.cisco.com/web/about/doing_business/legal/cri/
>>> 
>>> _______________________________________________
>>> users mailing list
>>> us...@open-mpi.org
>>> http://www.open-mpi.org/mailman/listinfo.cgi/users
>> _______________________________________________
>> users mailing list
>> us...@open-mpi.org
>> http://www.open-mpi.org/mailman/listinfo.cgi/users
>> _______________________________________________
>> users mailing list
>> us...@open-mpi.org
>> http://www.open-mpi.org/mailman/listinfo.cgi/users
> 
> 
> -- 
> Jeff Squyres
> jsquy...@cisco.com
> For corporate legal information go to: 
> http://www.cisco.com/web/about/doing_business/legal/cri/
> 
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/users

Reply via email to