Re: [OMPI users] v1.3 on Mac OS X with PGI 8.0

2009-03-19 Thread Jeff Squyres
That's fun. I'm afraid I don't have access to PGI 8.0 on OSX -- I can't really track this down any further. Can someone check this platform? On Mar 19, 2009, at 11:10 AM, Tim Campbell wrote: During "make install" of OpenMPI v1.3 on Mac OSX with PGI 8.0 I get the following error. -

Re: [OMPI users] mpirun exit status

2009-03-19 Thread Jeff Squyres
I believe that this was just fixed in OMPI v1.3.1 -- could you try upgrading? On Mar 19, 2009, at 10:58 AM, Cristian KLEIN wrote: Hello everybody, I've been using OpenMPI 1.3's mpirun in Makefiles and observed that the exit status is not always the one I expect. For example, using an inco

[OMPI users] btl_openib_ib_max_inline_data warnings

2009-03-19 Thread Gary Draving
Hi All, I have written a simple ring program that seems to run fine but I get the following warning even though I am not explicitly defining the btl_openib_ib_max_inline_data with an MCA parm. I'm only getting the warning in the 3 machines that have the QLE7240, the other 40+ machines with M

Re: [OMPI users] OpenMPI 1.3 and SGE 6.2u1

2009-03-19 Thread Rolf Vandevaart
Your understanding is exactly right. This issue came up earlier today. The suggestion was to add one of the following to your mpirun command. --mca orte_leave_session_attached 1 -leave-session-attached Here is the thread from earlier. http://www.open-mpi.org/community/lists/users/2009/03/8511.

Re: [OMPI users] OpenMPI 1.3 and SGE 6.2u1

2009-03-19 Thread Ralph Castain
Just add --leave-session-attached to your cmd line On Mar 19, 2009, at 10:19 AM, Malone, Scott wrote: Since I'm new to openMPI I wanted to make sure that I understand this. When the jobs starts orted is daemonized and because of this they are not bound the sge_shephered on each node. This

Re: [OMPI users] OpenMPI 1.3 and SGE 6.2u1

2009-03-19 Thread Malone, Scott
Since I'm new to openMPI I wanted to make sure that I understand this. When the jobs starts orted is daemonized and because of this they are not bound the sge_shephered on each node. This results in the loss of account for those processes. I guess that when I start mpirun with debugging, the

Re: [OMPI users] OpenMPI 1.3 and SGE 6.2u1

2009-03-19 Thread Reuti
Hi, Am 19.03.2009 um 16:07 schrieb Malone, Scott: I am having two problem with the integration of OpenMPI 1.3 and SGE 6.2u1, which we are new with both. The troubles are getting jobs to suspend/resume and collect cpu time correctly. For suspend/resume I have added the following to my mp

[OMPI users] v1.3 on Mac OS X with PGI 8.0

2009-03-19 Thread Tim Campbell
During "make install" of OpenMPI v1.3 on Mac OSX with PGI 8.0 I get the following error. -- Making install in mca/paffinity/darwin source='paffinity_darwin_module.c' object='paffinity_darwin_module.lo' libtool=yes \

[OMPI users] OpenMPI 1.3 and SGE 6.2u1

2009-03-19 Thread Malone, Scott
I am having two problem with the integration of OpenMPI 1.3 and SGE 6.2u1, which we are new with both. The troubles are getting jobs to suspend/resume and collect cpu time correctly. For suspend/resume I have added the following to my mpirun command: --mca orte_forward_job_control 1 --mca plm_

[OMPI users] mpirun exit status

2009-03-19 Thread Cristian KLEIN
Hello everybody, I've been using OpenMPI 1.3's mpirun in Makefiles and observed that the exit status is not always the one I expect. For example, using an incorrect machinefile makes mpirun return 0, whereas a non-zero value would be expected: --- cut here --- masternode:~/grid/myTests/hellompi$

Re: [OMPI users] v1.3: mca_common_sm_mmap_init error

2009-03-19 Thread Ralph Castain
Or you can just add --leave-session-attached to your mpirun cmd line On Mar 19, 2009, at 8:10 AM, Rolf Vandevaart wrote: On 03/19/09 09:55, Dave Love wrote: Prentice Bisbal writes: I just installed OpenMPI 1.3 with tight integration for SGE. Version 1.2.8 was working just fine for several m

Re: [OMPI users] selected pml cm, but peer [[2469, 1], 0] on compute-0-0 selected pml ob1

2009-03-19 Thread Gary Draving
Wow!, that seems to have worked. fs1 has a Qlogic QLE7240. I got it to work from the command line first then added "pml - ob1" to /usr/local/etc/openmpi-mca-params.conf which works as well. Thanks for all your help!. Gary Nysal Jan wrote: fs1 is selecting the "cm" PML whereas other nodes

Re: [OMPI users] v1.3: mca_common_sm_mmap_init error

2009-03-19 Thread Rolf Vandevaart
On 03/19/09 09:55, Dave Love wrote: Prentice Bisbal writes: I just installed OpenMPI 1.3 with tight integration for SGE. Version 1.2.8 was working just fine for several months in the same arrangement. Now that I've upgraded to 1.3, I get the following errors in my standard error file: mca_co

Re: [OMPI users] v1.3: mca_common_sm_mmap_init error

2009-03-19 Thread Dave Love
Prentice Bisbal writes: > I just installed OpenMPI 1.3 with tight integration for SGE. Version > 1.2.8 was working just fine for several months in the same arrangement. > > Now that I've upgraded to 1.3, I get the following errors in my standard > error file: > > mca_common_sm_mmap_init: open /tm

[OMPI users] Open MPI v1.3.1 released

2009-03-19 Thread Ralph Castain
The Open MPI Team, representing a consortium of research, academic, and industry partners, is pleased to announce the release of Open MPI version 1.3.1. This release is mainly a bug fix release over the v1.3.0 release, but there are few new features. We strongly recommend that all users upgrade t

Re: [OMPI users] selected pml cm, but peer [[2469, 1], 0] on compute-0-0 selected pml ob1

2009-03-19 Thread Nysal Jan
fs1 is selecting the "cm" PML whereas other nodes are selecting the "ob1" PML component. You can force ob1 to be used via "--mca pml ob1" What kind of hardware/NIC does fs1 have? --Nysal On Wed, 2009-03-18 at 17:17 -0400, Gary Draving wrote: > Hi all, > > anyone ever seen an error like this? Se