[OMPI users] [Fwd: Re: Configure OpenMPI and SLURM on Debian (Lenny)]

2009-03-26 Thread Jerome BENOIT
Original Message Subject: Re: [OMPI users] Configure OpenMPI and SLURM on Debian (Lenny) List-Post: users@lists.open-mpi.org Date: Fri, 27 Mar 2009 04:36:39 +0800 From: Jerome BENOIT Reply-To: jgmben...@mailsnare.net Organization: none CC: Open MPI Users References: <49c8997

[OMPI users] 'orte_ess_base_select failed'

2009-03-26 Thread Russell McQueeney
I installed OpenMPI 1.3.1, and whenever I or mpirun try to start orted on any of the machines, it shows that message, and --> Returned value Not found (-13) instead of ORTE-SUCCESS Is there anything obvious that I missed? My machines are Intel x86-32, running fedora (10 and 2)

Re: [OMPI users] Same bug in v1.0.6

2009-03-26 Thread Shaun Jackman
Please ignore the following message. It wasn't intended for the Open MPI list. My apologies. Cheers, Shaun Shaun Jackman wrote: Hi Todd, Back to the drawing board for me. The assertion is stating that all the tips should have been eroded in a single pass (and 2654086 tips were), but in a se

Re: [OMPI users] Same bug in v1.0.6

2009-03-26 Thread Shaun Jackman
Hi Todd, Back to the drawing board for me. The assertion is stating that all the tips should have been eroded in a single pass (and 2654086 tips were), but in a second pass it unexpectedly found 2 more tips. As a workaround until I nail this bug, you can downgrade this error to a warning by r

[OMPI users] Bug in MPI_Request_get_status (1.3.1) [PATCH]

2009-03-26 Thread Shaun Jackman
MPI_Request_get_status fails if the status parameter is passed MPI_STATUS_IGNORE. A patch is attached. Cheers, Shaun 2009-03-26 Shaun Jackman * ompi/mpi/c/request_get_status.c (MPI_Request_get_status): Do not fail if the status argument is NULL, because the application may pass

Re: [OMPI users] error polling LP CQ with status RETRY EXCEEDED ERROR

2009-03-26 Thread Ralph Castain
The default retry values are wrong and will be corrected in the next OMPI release. For now, try running with: -mca btl_openib_ib_min_rnr_timer 25 -mca btl_openib_ib_timeout 20 Should work. Ralph On Mar 26, 2009, at 2:16 PM, Gary Draving wrote: Hi Everyone, I'm doing some performance testin

[OMPI users] error polling LP CQ with status RETRY EXCEEDED ERROR

2009-03-26 Thread Gary Draving
Hi Everyone, I'm doing some performance testing using HPL with TCP turned off. My HPL.dat file looks like the following: It seems to work well for lower Ns values but as I increase that value it inevitably fails with "[[13535,1],169][btl_openib_component.c:2905:handle_wc] from compute-0-0.lo

Re: [OMPI users] PML add procs failed --> Returned "Unreachable" (-12) instead of "Success" (0)

2009-03-26 Thread Ralph Castain
On Mar 26, 2009, at 10:59 AM, Alessandro Surace wrote: Hi Ralph, what do you mean to create/define a directly interface? The 3 hosts are network connected and ssh pub key enabled. Every hosts can see the other but they are not all on the same direct connected network . More in detail: gri

Re: [OMPI users] PML add procs failed --> Returned "Unreachable" (-12) instead of "Success" (0)

2009-03-26 Thread Alessandro Surace
Hi Ralph, what do you mean to create/define a directly interface? The 3 hosts are network connected and ssh pub key enabled. Every hosts can see the other but they are not all on the same direct connected network . More in detail: grid01 and grid04 are in the same network grid03 is on different ne

[OMPI users] Linux opteron infiniband sunstudio configure problem

2009-03-26 Thread Kevin McManus
Hi All, As a complete beginner (to OpenMPI) I am attempting to build on a Linux opteron infiniband platform using SunStudio compilers. My build script looks like... #!/bin/sh ../configure x86_64 \ CC=cc CXX=CC F77=f77 FC=f90 \ CFLAGS=-m64 CXXFLAGS=-m64 FFLAGS=-m64 FCFLAGS=-m64

Re: [OMPI users] PML add procs failed --> Returned "Unreachable" (-12) instead of "Success" (0)

2009-03-26 Thread Ralph Castain
Hi Alex The problem is likely to be that there is no connection available between grid03 and grid04. Did you create/define an Ethernet/IP interface directly between those two virtual machines? Otherwise, there is no way for MPI to communicate between them. Ralph On Mar 26, 2009, at 9:00

[OMPI users] PML add procs failed --> Returned "Unreachable" (-12) instead of "Success" (0)

2009-03-26 Thread Alessandro Surace
Hi Guys, I'm really new in mpi. I've created a small grid composed of virtual and real host. grid01 real/Centos/(Open MPI) 1.2.5 grid03 virtual (xen)/Centos/(Open MPI) 1.2.5 grid04 virtual (vmware)/Centos/(Open MPI) 1.2.5 When I try to run a parallel job like this: mpirun -v -H grid01,grid04 -np

Re: [OMPI users] MPI_Test without deallocation

2009-03-26 Thread Richard Treumann
You can use MPI_REQUEST_GET_STATUS as a way to "test" without deallocation. I do not understand the reason you would forward the request (as a request) to another function. The data is already in a specific receive buffer by the time an MPI_Test returns TRUE so calling the function and passing i