Re: [OMPI users] Deprecated parameter: plm_rsh_agent

2010-11-05 Thread Samuel K. Gutierrez
m:name:pls_rsh_agent mca:orte:base:param:orte_rsh_agent:synonym:name:plm_rsh_agent mca:plm:base:param:plm_rsh_agent:synonym_of:name:orte_rsh_agent -- Samuel K. Gutierrez Los Alamos National Laboratory On Nov 5, 2010, at 12:41 PM, Joshua Bernstein wrote: Hello All, When building the examples included with OpenMPI version 1.5 I see a messag

Re: [OMPI users] Segmentation fault with SLURM and non-local nodes

2011-02-02 Thread Samuel K. Gutierrez
Hi, We'll try to reproduce the problem. Thanks, -- Samuel K. Gutierrez Los Alamos National Laboratory On Feb 2, 2011, at 2:55 AM, Michael Curtis wrote: On 28/01/2011, at 8:16 PM, Michael Curtis wrote: On 27/01/2011, at 4:51 PM, Michael Curtis wrote: Some more debugging informatio

Re: [OMPI users] Segmentation fault with SLURM and non-local nodes

2011-02-03 Thread Samuel K. Gutierrez
-nopanasas I'll dig a bit further. Sam On Feb 2, 2011, at 9:53 AM, Samuel K. Gutierrez wrote: Hi, We'll try to reproduce the problem. Thanks, -- Samuel K. Gutierrez Los Alamos National Laboratory On Feb 2, 2011, at 2:55 AM, Michael Curtis wrote: On 28/01/2011, at 8:16 PM, Mich

Re: [OMPI users] Segmentation fault with SLURM and non-local nodes

2011-02-07 Thread Samuel K. Gutierrez
Hi, A detailed backtrace from a core dump may help us debug this. Would you be willing to provide that information for us? Thanks, -- Samuel K. Gutierrez Los Alamos National Laboratory On Feb 6, 2011, at 6:36 PM, Michael Curtis wrote: On 04/02/2011, at 9:35 AM, Samuel K. Gutierrez

Re: [OMPI users] Segmentation fault with SLURM and non-local nodes

2011-02-08 Thread Samuel K. Gutierrez
Hi Michael, You may have tried to send some debug information to the list, but it appears to have been blocked. Compressed text output of the backtrace text is sufficient. Thanks, -- Samuel K. Gutierrez Los Alamos National Laboratory On Feb 7, 2011, at 8:38 AM, Samuel K. Gutierrez wrote

Re: [OMPI users] Segmentation fault with SLURM and non-local nodes

2011-02-09 Thread Samuel K. Gutierrez
https://svn.open-mpi.org/trac/ompi/ticket/2717 Ralph On Feb 8, 2011, at 6:23 PM, Michael Curtis wrote: On 09/02/2011, at 9:16 AM, Ralph Castain wrote: See below On Feb 8, 2011, at 2:44 PM, Michael Curtis wrote: On 09/02/2011, at 2:17 AM, Samuel K. Gutierrez wrote: Hi Michael, You

Re: [OMPI users] OMPI seg fault by a class with weird address.

2011-03-15 Thread Samuel K. Gutierrez
I -think- setting OMPI_MCA_memory_ptmalloc2_disable to 1 will turn off OMPI's memory wrappers without having to rebuild. Someone please correct me if I'm wrong :-). For example (bash-like shell): export OMPI_MCA_memory_ptmalloc2_disable=1 Hope that helps, -- Samuel K. Gut

Re: [OMPI users] mpi problems,

2011-04-04 Thread Samuel K. Gutierrez
Hi, Try prepending the path to your compiler libraries. Example (bash-like): export LD_LIBRARY_PATH=/compiler/prefix/lib:/ompi/prefix/lib: $LD_LIBRARY_PATH -- Samuel K. Gutierrez Los Alamos National Laboratory On Apr 4, 2011, at 1:33 PM, Nehemiah Dacres wrote: altering LD_LIBRARY_PATH

Re: [OMPI users] mpi problems,

2011-04-04 Thread Samuel K. Gutierrez
What does 'ldd ring2' show? How was it compiled? -- Samuel K. Gutierrez Los Alamos National Laboratory On Apr 4, 2011, at 1:58 PM, Nehemiah Dacres wrote: [jian@therock ~]$ echo $LD_LIBRARY_PATH /opt/sun/sunstudio12.1/lib:/opt/vtk/lib:/opt/gridengine/lib/lx26- amd64:/opt/gridengin

Re: [OMPI users] btl_openib_cpc_include rdmacm questions

2011-05-16 Thread Samuel K. Gutierrez
Hi, Just out of curiosity - what happens when you add the following MCA option to your openib runs? -mca btl_openib_flags 305 Thanks, Samuel Gutierrez Los Alamos National Laboratory On May 13, 2011, at 2:38 PM, Brock Palen wrote: > On May 13, 2011, at 4:09 PM, Dave Love wrote: > >> Jeff Squ

Re: [OMPI users] btl_openib_cpc_include rdmacm questions

2011-05-16 Thread Samuel K. Gutierrez
On May 16, 2011, at 8:53 AM, Brock Palen wrote: > > > > On May 16, 2011, at 10:23 AM, Samuel K. Gutierrez wrote: > >> Hi, >> >> Just out of curiosity - what happens when you add the following MCA option >> to your openib runs? >> >> -mc

Re: [OMPI users] Openib with > 32 cores per node

2011-05-19 Thread Samuel K. Gutierrez
Hi, Try the following QP parameters that only use shared receive queues. -mca btl_openib_receive_queues S,12288,128,64,32:S,65536,128,64,32 Samuel K. Gutierrez Los Alamos National Laboratory On May 19, 2011, at 5:28 AM, Robert Horton wrote: > Hi, > > I'm having proble

Re: [OMPI users] Openib with > 32 cores per node

2011-05-19 Thread Samuel K. Gutierrez
Hi, On May 19, 2011, at 9:37 AM, Robert Horton wrote > On Thu, 2011-05-19 at 08:27 -0600, Samuel K. Gutierrez wrote: >> Hi, >> >> Try the following QP parameters that only use shared receive queues. >> >> -mca btl_openib_receive_queues S,12288,128,64,32:S,65

Re: [OMPI users] [ompi-1.4.2] Infiniband issue on smoky @ ornl

2011-06-23 Thread Samuel K. Gutierrez
Hi, What happens when you don't run with per-peer queue pairs? Try: -mca btl_openib_receive_queues S,4096,128:S,12288,128:S,65536,128 -- Samuel K. Gutierrez Los Alamos National Laborator On Jun 23, 2011, at 7:56 AM, Mathieu Gontier wrote: > Hello, > > Thank for the answer.

Re: [OMPI users] [ompi-1.4.2] Infiniband issue on smoky @ ornl

2011-06-23 Thread Samuel K. Gutierrez
Hi, QP = Queue Pair Here are a couple of nice FAQ entries that I find useful. http://www.open-mpi.org/faq/?category=openfabrics And videos: http://www.open-mpi.org/video/?category=openfabrics -- Samuel K. Gutierrez Los Alamos National Laboratory On Jun 23, 2011, at 8:22 AM, Mathieu Gontier

Re: [OMPI users] qp memory allocation problem

2011-09-12 Thread Samuel K. Gutierrez
opers: Maybe we should consider disabling the use of per-peer queue pairs by default. Do they buy us anything? For what it is worth, we have stopped using them on all of our large systems here at LANL. Thanks, Samuel K. Gutierrez Los Alamos National Laboratory On Sep 12, 2011, at 9:23 AM, B

Re: [OMPI users] EXTERNAL: Re: qp memory allocation problem

2011-09-12 Thread Samuel K. Gutierrez
x27; queue pair type identifier corresponds to "Shared queues." The 'P' queue pair type identifier corresponds to "Per-peer queues." Hope that helps, Sam > > Ed > > From: users-boun...@open-mpi.org [mailto:users-boun...@open-mpi.org] On > Behalf Of Samuel K

Re: [OMPI users] How could OpenMPI (or MVAPICH) affect floating-point results?

2011-09-20 Thread Samuel K. Gutierrez
/j.parco.2011.02.009 Hope that helps, Samuel K. Gutierrez Los Alamos National Laboratory On Sep 20, 2011, at 6:25 AM, Reuti wrote: > Am 20.09.2011 um 13:52 schrieb Tim Prince: > >> On 9/20/2011 7:25 AM, Reuti wrote: >>> Hi, >>> >>> Am 20.09.2011 um 00:41

Re: [OMPI users] Problem building OpenMPI 1.4.4 with PGI 11.7 compilers

2011-11-08 Thread Samuel K. Gutierrez
**should** work, if that's the problem that you are experiencing. I would suggest starting with a fresh source tree, before you try again. Hope that helps, Samuel K. Gutierrez Los Alamos National Laboratory On Nov 8, 2011, at 2:06 PM, Gustavo Correa wrote: > Dear OpenMPI pros >

Re: [OMPI users] profile the performance of a MPI code: how much traffic is being generated?

2009-09-29 Thread Samuel K. Gutierrez
Hi, I'm writing a simple post-mortem profiling tool that provides some of the information that you are looking for. That being said, the tool, Loba, isn't publicly available just yet. In the mean time, take a look at mpiP (http://mpip.sourceforge.net/). -- Samuel K. Gut

Re: [OMPI users] memalign usage in OpenMPI and it's consequences for TotalVIew

2009-10-01 Thread Samuel K. Gutierrez
Hi, I think Jeff has already addressed this problem. https://svn.open-mpi.org/trac/ompi/changeset/21744 -- Samuel K. Gutierrez Los Alamos National Laboratory On Oct 1, 2009, at 11:25 AM, Peter Thompson wrote: We had a question from a user who had turned on memory debugging in TotalView and

Re: [OMPI users] memalign usage in OpenMPI and it's consequencesfor TotalVIew

2009-10-01 Thread Samuel K. Gutierrez
Ticket created (#2040). I hope it's okay ;-). -- Samuel K. Gutierrez Los Alamos National Laboratory On Oct 1, 2009, at 11:58 AM, Jeff Squyres wrote: Did that make it over to the v1.3 branch? On Oct 1, 2009, at 1:39 PM, Samuel K. Gutierrez wrote: Hi, I think Jeff has already addr

Re: [OMPI users] memalign usage in OpenMPI and it's consequencesfor TotalVIew

2009-10-01 Thread Samuel K. Gutierrez
nment error when memory debugging is enabled. -- Samuel K. Gutierrez Los Alamos National Laboratory On Oct 1, 2009, at 12:56 PM, Ashley Pittman wrote: Simple malloc() returns pointers that are at least eight byte aligned anyway, I'm not sure what the reason for calling memalign() with a valu

Re: [OMPI users] memchecker overhead?

2009-10-26 Thread Samuel K. Gutierrez
you run command. --mca mpi_leave_pinned 0 -- Samuel K. Gutierrez Los Alamos National Laboratory On Oct 26, 2009, at 1:41 PM, Jed Brown wrote: Jeff Squyres wrote: Using --enable-debug adds in a whole pile of developer-level run-time checking and whatnot. You probably don't want that on pr

Re: [OMPI users] Problem in using openmpi

2010-03-12 Thread Samuel K. Gutierrez
Open MPI installation is rooted at / home/jess/local/ompi and the libraries are located in /home/jess/local/ ompi/lib64, try (bash-like shell): export LD_LIBRARY_PATH= /home/jess/local/ompi/lib64 Hope this helps, -- Samuel K. Gutierrez Los Alamos National Laboratory On Mar 12, 2010, at 1:

Re: [OMPI users] Problem in using openmpi

2010-03-12 Thread Samuel K. Gutierrez
One more thing. The line should have been: export LD_LIBRARY_PATH=/home/jess/local/ompi/lib64 The space in the previous email will make bash unhappy 8-|. -- Samuel K. Gutierrez Los Alamos National Laboratory On Mar 12, 2010, at 1:56 PM, Samuel K. Gutierrez wrote: Hi, It sounds like you

Re: [OMPI users] Problem in using openmpi

2010-03-12 Thread Samuel K. Gutierrez
Hi, If lib64 isn't there, try lib. That is, export LD_LIBRARY_PATH=/home/jess/local/ompi/lib Referencing the example that I provided earlier. -- Samuel K. Gutierrez Los Alamos National Laboratory On Mar 12, 2010, at 3:31 PM, vaibhav dutt wrote: Hi, I used the export command b

Re: [OMPI users] How do I run OpenMPI safely on a Nehalem standalone machine?

2010-05-06 Thread Samuel K. Gutierrez
? Thanks, -- Samuel K. Gutierrez Los Alamos National Laboratory On May 6, 2010, at 12:01 PM, Gus Correa wrote: Hi Eugene Thanks for the detailed answer. * 1) Now I can see and use the btl_sm_num_fifos component: I had committed already "btl = ^sm" to the openmpi-mca-par

Re: [OMPI users] How do I run OpenMPI safely on a Nehalem standalone machine?

2010-05-06 Thread Samuel K. Gutierrez
Hi Gus, Doh! I didn't see the kernel-related messages after the segfault message. Definitely some weirdness here that is beyond your control... Sorry about that. -- Samuel K. Gutierrez Los Alamos National Laboratory On May 6, 2010, at 3:28 PM, Gus Correa wrote: Hi Samuel Sam