Re: [OMPI users] Processor affinitiy

2008-04-28 Thread Brian Taylor
Jeff, The utilBindThreadToCPU(x) function is part of a private framework, CHUD.framework, that is installed in /System/Library/PrivateFrameworks. Apple provides but does not support this framework. Any part of it could be changed or removed in future releases of CHUD without any notice or deprec

Re: [OMPI users] blcr_checkpoint_peer: execvp returned -1

2008-04-28 Thread Josh Hursey
I don't think I have ever seen this one before. :( So you are trying to checkpoint the MPI process by hand or a non-MPI process? Can you confirm that you can successfully checkpoint/restart a non-MPI process on these machines? What version of the Open MPI trunk are you using? Have you made

Re: [OMPI users] infiniband

2008-04-28 Thread Jeff Squyres
Open MPI does not register with HCAs / ports in a way visible through OFED command line tools, sorry... On Apr 27, 2008, at 11:19 AM, SLIM H.A. wrote: Is it possible to get information about the usage of hca ports similar to the result of the mx_endpoint_info command for Myrinet boards? Th

Re: [OMPI users] Processor affinitiy

2008-04-28 Thread Jeff Squyres
Just curious -- in regards to utilBindThreadToCPU(x), why does the text say: "That function should never go in a shipping app, but it’s useful for debugging." On Apr 28, 2008, at 4:30 PM, Brian Taylor wrote: Actually, there is an unofficial processor affinity API on Mac OS X, but it

Re: [OMPI users] openmpi-1.3a1r18241 ompi-restart issue

2008-04-28 Thread Josh Hursey
On Apr 25, 2008, at 6:12 PM, Sharon Brunett wrote: Josh, I'm responding to some outstanding questions about the env. I'm trying to ompi-restart in. My answers to your questions are sprinkled below, and include a few more questions based on attempts I've made to get a multi-node restart wo

Re: [OMPI users] Processor affinitiy

2008-04-28 Thread Brian Taylor
Actually, there is an unofficial processor affinity API on Mac OS X, but it is supplied only with the CHUD framework. I suppose as a further barrier to using this API in code outside of Apple, the header files for this API are only available with the standalone CHUD installer. See: http://lists.a

[OMPI users] setting the btl_tcp_eager_limit

2008-04-28 Thread jean-christophe.mig...@ens-lyon.fr
Hi all, We're using a pingpong in order to measure the bandwidth and latency available with open MPI. In our first experiments done with the 1.1.4 version, we were using the btl_tcp_eager_limit parameter to modify the eager limit. We've upgraded to the 1.2.6 version and the limit parameter we

Re: [OMPI users] trouble building on a macbook

2008-04-28 Thread Doug Reeder
Robert, Did you mean to install openmpi-1.2.6 in /usr. That is where the apple supplied openmpi-1.2.3 in is installed. That doesn't appear to be the problem causing your make install error. Were there any warnings or errors when you ran make. Doug Reeder On Apr 27, 2008, at 1:11 PM, Rober

Re: [OMPI users] blcr_checkpoint_peer: execvp returned -1

2008-04-28 Thread Leonardo Fialho
Changing some parameters (blcr_checkpoint_cmd): [aogrd01:08552] crs:blcr: checkpoint(8552, ---) [aogrd01:08552] crs:blcr: checkpoint_peer(8552, --) [aogrd01:08552] crs:blcr: get_checkpoint_filename(--, 8552) [aogrd01:08552] crs:blcr: checkpoint_cmd(8552) [aogrd01:08552] crs:blcr: blcr_checkpoint_

[OMPI users] blcr_checkpoint_peer: execvp returned -1

2008-04-28 Thread Leonardo Fialho
Hi All, Does anybody experiment this error? [aogrdini:09070] Global) Receive a command message from [[13242,0],0]. ... [aogrd02:07642] Local) Receive a command message. ... [aogrd01:07938] Local) Receive a command message. ... [aogrd01:07941] App) signal_handler: Receive Checkpoint Request. ...

Re: [OMPI users] Message compression in OpenMPI

2008-04-28 Thread Tomas Ukkonen
Aurélien Bouteiller wrote: > From a pretty old experiment I made, compression was giving good > results on 10Mbps network but was actually decreasing RTT on 100Mbs > and more. I played with all the zlib settings from 1 to 9, and > actually even the low compression setting was unable to reach