Re: [OMPI users] Bus Error in ompi_free_list_grow

2008-11-14 Thread Jeff Squyres
If this is the problem, that's good -- it just means that we need a better error detection in the case where we run out of memory, etc. Stay tuned to that thread to see what happens. On Nov 14, 2008, at 1:14 PM, Peter Cebull wrote: Jeff Squyres wrote: Could this issue actually be related

Re: [OMPI users] Bus Error in ompi_free_list_grow

2008-11-14 Thread Peter Cebull
Jeff Squyres wrote: Could this issue actually be related to: http://www.open-mpi.org/community/lists/devel/2008/11/4882.php (read through the thread to get to the error handling stuff) You might be right that this issue is the problem. Our system has diskless nodes, so /tmp uses a ramdisk.

Re: [OMPI users] Bus Error in ompi_free_list_grow

2008-11-14 Thread Jeff Squyres
Could this issue actually be related to: http://www.open-mpi.org/community/lists/devel/2008/11/4882.php (read through the thread to get to the error handling stuff) On Nov 14, 2008, at 7:41 AM, Geraldo Veiga wrote: Thanks Peter. Blocking the shared memory layer did the trick for our p

Re: [OMPI users] avoid usage of ssh on local machine

2008-11-14 Thread Ralph Castain
Hello OMPI doesn't use ssh by default to launch a daemon local to mpirun - instead, we locally fork/exec the orted. The problem here is that OMPI doesn't realize that you are launching on the local machine. This is usually caused by confusion when IP resolving the hostname returned by get

Re: [OMPI users] timing + /usr/bin/time

2008-11-14 Thread Jeff Squyres
FWIW, I *always* report MPI application time in wall-clock seconds time. I know that some people (even among the OMPI developers) disagree with me, but to me, there's nothing else that you can measure that makes sense. Case in point: when using the OpenFabrics network stack, very little

Re: [OMPI users] Bus Error in ompi_free_list_grow

2008-11-14 Thread Geraldo Veiga
Thanks Peter. Blocking the shared memory layer did the trick for our program too. For the record, we also have SGI Propack 6 installed (sgi-propack-release-6-sgi600r3). Is the on-node shared memory support completely blocked? What if the MPI process calls a procedure that uses OpenMP threads, f

[OMPI users] avoid usage of ssh on local machine

2008-11-14 Thread Sun, Yongqi (E F ES EN 72)
Hello, I have two questions about ssh and details follow. Questions: Is there any way to prevent the usage of ssh on my local desktop and launch locally by default? (The FAQ page writes "Also note that if using a launcher that uses a hostfile and no hostfile is specified, all processes are launc

Re: [OMPI users] timing + /usr/bin/time

2008-11-14 Thread Raymond Wan
Hi Reuti, I have to admit that I'm so familiar with SGE, but I'll take a look at it so that I'll learn something. In my current situation, I don't /need/ to report a user time. I was just wondering if it has any meaning and what people mean when they show numbers or a graph and just says "

Re: [OMPI users] timing + /usr/bin/time

2008-11-14 Thread Raymond Wan
Hi Fabian, Thank you for clarifying things and confirming some of the things that I thought. I guess I have a clearer understanding now. Fabian Hänsel wrote: H, I guess user time does not matter since it is real time that we are interested in reducing. Right. Even if we *could*