[OMPI users] Segmentation Fault--libc.so.6(__libc_start_main...

2008-09-19 Thread Shafagh Jafer
Hi every one, I need urgent help plz :-( I am getting the following error when i run my program. OpenMPI compilation was all fine and went well, but now i dont understand the source of this error: [node01:29264] *** Process received signal *** [nod

[OMPI users] Segmentation Fault--libc.so.6(__libc_start_main...

2008-09-19 Thread Shafagh Jafer
Hi every one, I need urgent help plz :-( I am getting the following error when i run my program. OpenMPI compilation was all fine and went well, but now i dont understand the source of this error: [node01:29264] *** Process received signal *** [node01:2

Re: [OMPI users] MPI Finalize

2008-09-19 Thread Tim Mattox
This sounds like you have left a file open when using the MPI-2 I/O. You need to MPI_File_close() any files you have opened. On Fri, Sep 19, 2008 at 6:10 PM, Gabriele Fatigati wrote: > Hi, > i'm developing a C code under OpenMPI 1.2.5 with parallel I/O by MPI-2. > I have a strange problem in the

[OMPI users] TCP BTL in different subnets?

2008-09-19 Thread Jeroen Kleijer
Hi, I'm trying to get an openmpi application running accross different nodes but seem to have hit a snag when the processes are on different nodes, especially when the machines are on different TCP subnets. The orted daemons start up fine but after that application borks with the message [0,1,2][

[OMPI users] Configure and Build ok, but mpi module not recognized?

2008-09-19 Thread Brian Harker
Hi all- New openMPI user here with a question that I cannot find any information on the tubes for: I have configured openMPI to work with the Intel C (icc) and C++ (icpc) compilers, as well as the Intel fortran (ifort) compiler, and built all the single choice buffer fortran 90 bindings: ./confi

Re: [OMPI users] Mpirun don't execute

2008-09-19 Thread Santolo Felaco
Hi, I try to be clearer: osa@libertas:~$ echo $LD_LIBRARY_PATH /usr/local/lib:/home/osa/blcr/lib osa@libertas:~$ echo $PATH /usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/home/osa/blcr/bin I compile the file with mpicc, then: osa@libertas:~/prove/openmpi$ mpirun -np 2 es1

[OMPI users] MPI Finalize

2008-09-19 Thread Gabriele Fatigati
Hi, i'm developing a C code under OpenMPI 1.2.5 with parallel I/O by MPI-2. I have a strange problem in the MPI_Finalize() routine. The code generates message reported below : *** An error occurred in MPI_Barrier *** after MPI was finalized *** MPI_ERRORS_ARE_FATAL (goodbye) In my code, i don't u

Re: [OMPI users] error while loading shared libraries: libmpi_cxx.so.0

2008-09-19 Thread Shafagh Jafer
yes they are installed on all machines by a technician. I can see the openmpi '/lib/' directory on all machines. --- On Fri, 9/19/08, Jeff Squyres wrote: From: Jeff Squyres Subject: Re: [OMPI users] error while loading shared libraries: libmpi_cxx.so.0 To: "Open MPI Users" List-Post: users@li

Re: [OMPI users] error while loading shared libraries: libmpi_cxx.so.0

2008-09-19 Thread Jeff Squyres
You do need to have Open MPI installed on all machines, and have the Open MPI libraries be able to be found on all machines. See these FAQ entries: http://www.open-mpi.org/faq/?category=running#run-prereqs http://www.open-mpi.org/faq/?category=running#adding-ompi-to-path http://w

[OMPI users] error while loading shared libraries: libmpi_cxx.so.0

2008-09-19 Thread Shafagh Jafer
I did the modifications you mentioned and I got it working. Now when I use the mpirun to run an example on one or multiple machines I get the following error: = "error while loading shared libraries: libmpi_cxx.so.0: cannot open shared object fi

Re: [OMPI users] what is inside mpicc/mpic++

2008-09-19 Thread Shafagh Jafer
I did the modifications you mentioned and I got it working. Now when I use the mpirun to run an example on one or multiple machines I get the following error: = "error while loading shared libraries: libmpi_cxx.so.0: cannot open shared object fi

Re: [OMPI users] Newbie doubt.

2008-09-19 Thread Jeff Squyres
Can you narrow down the problem a bit? Have you run through a debugger, perchance, to see if anything turns up? Is your code working (running to completion and producing correct answers) at low process counts, etc.? On Sep 19, 2008, at 5:01 PM, Davi Vercillo C. Garcia (デビッド) wrote:

Re: [OMPI users] Mpirun don't execute

2008-09-19 Thread Jeff Squyres
Could you send a little more detailed explanation of exactly what is going wrong? Remember: the more information you include in your report, the better. E-mails simply stating, "It doesn't work!" are not helpful; we need to know as much information about your environment as possible in orde

Re: [OMPI users] Newbie doubt.

2008-09-19 Thread Davi Vercillo C. Garcia (デビッド)
Hello again, My code evolved and I'm having more problems that I cannot solve. When I execute it, using a hight number of processes, it crashes. And I'm using MPI_Write but there's nothing in output file. My code can be found at: http://pastebin.com/d1ea278f1 -- Davi Vercillo Carneiro Garcia ht

Re: [OMPI users] Mpirun don't execute

2008-09-19 Thread Santolo Felaco
The zip file cointains only ASCII file. 2008/9/19 Santolo Felaco > The attachemente contains: config.log, output file of ompi_info --all and > output file of ipconfig. > I need of version 1.3 because I must use the checkpoint. > Thanks. > > > 2008/9/19 Jeff Squyres > >> Please see http://www.op

Re: [OMPI users] Mpirun don't execute

2008-09-19 Thread Santolo Felaco
The attachemente contains: config.log, output file of ompi_info --all and output file of ipconfig. I need of version 1.3 because I must use the checkpoint. Thanks. 2008/9/19 Jeff Squyres > Please see http://www.open-mpi.org/community/help/; this will help us > understand your problem. > > Also

[OMPI users] proper way to shut down orted

2008-09-19 Thread Timothy Kaiser
Greetings, I have a manager/worker application. The manager is called "t2a" and the workers "w2d" I launch the manager and each worker with its own mpiexec with n=1. The connect using various calls including MPI_Open_port, MPI_Comm_accept, MPI_Comm_connect and MPI_Intercomm_merge. It works fi

Re: [OMPI users] Strange segfault in openmpi

2008-09-19 Thread Robert Kubrick
The line Signal code: Address not mapped (1) indicates that there is probably a mismatch between the runtime library and the linked version. Make sure that you link the program and run it using the same installation base. Are the libraries in / usr/mpi/fsl_openmpi_gcc_1.2.6 the same you use

[OMPI users] Strange segfault in openmpi

2008-09-19 Thread Daniel Hansen
I work for a supercomputing organization and we just installed the latest version of rocks/centos on our cluster. We compiled openmpi from source to customize it for our purposes. Since switching we have receive messages from users about errors, segfaults, etc. that we didn't see before. Here is

Re: [OMPI users] Mpirun don't execute

2008-09-19 Thread Jeff Squyres
Please see http://www.open-mpi.org/community/help/; this will help us understand your problem. Also note that Open MPI v1.3 is not yet released. If you have v1.3, it's a pre-release version. On Sep 19, 2008, at 1:37 PM, Santolo Felaco wrote: Hi, I have installed OpenMpi 1.3, my OS il Li

Re: [OMPI users] Problem with MPI_Send and MPI_Recv

2008-09-19 Thread Terry Dontje
Hello Sofia, After further reflection I wonder if you have a firewall that is preventing connections to certain ports. --td Terry Dontje wrote: Hello Sofia, Ok, so I really wanted the stack of when you run with "-mca mpi_preconnect_all 1" I believe you'll see that one of the processes wi

[OMPI users] Mpirun don't execute

2008-09-19 Thread Santolo Felaco
Hi, I have installed OpenMpi 1.3, my OS il Linux Ubuntu 8.04. When I invoke the comand mpirun -np 1 es2 the shell is blocked. The comand CTRL+X is not catched. I have set defaut hostfile and I set LD_LIBRARY_PATH with /usr/local/lib. Help me, please

Re: [OMPI users] Problem with MPI_Send and MPI_Recv

2008-09-19 Thread Terry Dontje
Hello Sofia, Ok, so I really wanted the stack of when you run with "-mca mpi_preconnect_all 1" I believe you'll see that one of the processes will be in init. However, the stack still probably will not help me help you. What needs to happen is to step through the code in dbx while the conn

Re: [OMPI users] Problem with MPI_Send and MPI_Recv

2008-09-19 Thread Sofia Aparicio Secanellas
Hello Terry, I have installed 1.2.7 and I obtain the same result. I will explain you what I have done. 1. On my computer edu@10.1.10.240 I have added a new user called sofia. This way I have sofia@10.1.10.208 and sofia@10.1.10.240. 2. I have downloaded the openmpi 1.2.7 from the openmpi websit

Re: [OMPI users] Linpack Benchmark and File Descriptor Limits

2008-09-19 Thread John Hearns
2008/9/19 Alex Wolfe > I'm just running it using mpirun from the command line. Thanks for the > reply. > >> > > Have you checked what ulimit -a returns on all the nodes on your cluster, ie when you ssh into them what does ulimit -a give you? I may be on the wrong track here.

Re: [OMPI users] Problem with MPI_Send and MPI_Recv

2008-09-19 Thread Sofia Aparicio Secanellas
Thanks Terry, I will install 1.2.7. Sofia - Original Message - From: "Terry Dontje" To: Sent: Thursday, September 18, 2008 7:31 PM Subject: Re: [OMPI users] Problem with MPI_Send and MPI_Recv Turns out you debugged the mpirun I was actually wanting you to attach to your program, P

[OMPI users] Re :Re: Linpack Benchmark and File Descriptor Limits

2008-09-19 Thread Neeraj Chourasia
Hello,    With openmpi-1.3,  new mca feature is introduced namely --mca routed binomial. This ensures out of band communication to happen in binomial fashion and reduces the net socket opening and hence solves file open issues.-NeerajOn Thu, 18 Sep 2008 16:46:23 -0700 Open MPI Users wrote I'm

[OMPI users] MPI_Win_unlock error

2008-09-19 Thread Robert Kubrick
I have a crash on a call to PMPI_Win_unlock(). My program runs with openmpi 1.2.7 on Ubuntu. Are there any known issues in 1.2.7 with RMA window calls? Linux local 2.6.24-19-generic #1 SMP Wed Jun 18 14:43:41 UTC 2008 i686 GNU/Linux [local:27767] *** Process received signal *** [local:27767