[OMPI users] ompi-restart issue : ompi-restart doesn't work across nodes

2008-09-30 Thread arun dhakne
Hi all, I had gone through some previous ompi-restart issues but i couldn't find anything similar to this problem. I have installed blcr, and configured open-mpi 'openmpi-1.3a1r19645' i) If the sample mpi program say ( np 4 on single machine that is without any hostfile )is ran and I try to chec

[OMPI users] Running application with MPI_Comm_spawn() in multithreaded environment

2008-09-30 Thread Roberto Fichera
Hi All on the list, I'm trying to execute dynamic MPI applications using MPI_Comm_spawn(). The application I'm using for tests, basically is composed by a master, which spawn a slave in each assigned node in a multithreading fashion. The master is started with a number of jobs to perform and a fil

[OMPI users] possible bug: child stdout not redirected after MPI_Comm_spawn

2008-09-30 Thread André Gaul
Hey all! Last week I observed a strange behaviour of Open MPI when using MPI_Comm_spawn() to create new MPI processes: The child processes are started but after the childs call to MPI_Init() no output to stdout gets redirected to the stdout of the parent/mpirun process. Before the call to MPI_Init

Re: [OMPI users] MPI_ERR_TRUNCATE on MPI_Testsome

2008-09-30 Thread Jeff Squyres
On Sep 26, 2008, at 1:45 PM, Robert Kubrick wrote: I'm not sure how should I interpret this message: [local:17344] *** An error occurred in MPI_Testsome [local:17344] *** on communicator MPI COMMUNICATOR 5 CREATE FROM 0 [local:17344] *** MPI_ERR_TRUNCATE: message truncated [local:17344] *** MPI

Re: [OMPI users] Running application with MPI_Comm_spawn() in multithreaded environment

2008-09-30 Thread Roberto Fichera
Roberto Fichera ha scritto: > Hi All on the list, > > I'm trying to execute dynamic MPI applications using MPI_Comm_spawn(). > The application I'm using for tests, basically is > composed by a master, which spawn a slave in each assigned node in a > multithreading fashion. The master is started wit

Re: [OMPI users] possible bug: child stdout not redirected after MPI_Comm_spawn

2008-09-30 Thread Jeff Squyres
I believe that this issue has been fixed for the upcoming v1.3 series; it will not be fixed in the v1.2 series (we made extensive overhauls to the underlying run-time system for v1.3 which would be extraordinarily difficult to port back to the v1.2 series). On Sep 30, 2008, at 9:35 AM, And

Re: [OMPI users] TCP BTL in different subnets?

2008-09-30 Thread Jeff Squyres
Sorry for the delay in replying -- I thought I had replied to this already, but I guess I hadn't. :-( We've talked about this feature several times, but this specific functionality hasn't made it into the OMPI code base yet. Sorry! :-( (patches would be gladly accepted, but note that we'

Re: [OMPI users] TCP BTL in different subnets?

2008-09-30 Thread Jeroen Kleijer
Hi Jeff, No worries. I've been able to get the most recent (1.3a september 25th) to compile and it does exactly what I need it to do (which is work accross different subnets) and I can basically support that myself. (not quite sure what went wrong first time I tried this though) Strange thing is

Re: [OMPI users] Problem with MPI_Send and MPI_Recv

2008-09-30 Thread Jeff Squyres
This is quite the odd problem. 1. From prior mails, you do not seem to have iptables running to block any ports -- is there any other port blocking software running, perchance? 2. You do seem to be able to run non-MPI apps properly. 3. I assume that you would be able to run "hello world" k

Re: [OMPI users] Problem with MPI_Send and MPI_Recv

2008-09-30 Thread Sofia Aparicio Secanellas
Hello, This is quite the odd problem. 1. From prior mails, you do not seem to have iptables running to block any ports -- is there any other port blocking software running, perchance? I do not think so. What can I do to be sure that I do not have any other port blocking software running?

Re: [OMPI users] Execution in multicore machines

2008-09-30 Thread Jeff Squyres
Are these intel-based machines? I have seen similar effects mentioned earlier in this thread where having all 8 cores banging on memory pretty much kills performance on the UMA-style intel 8 core machines. I'm not a hardware expert, but I've stayed away from buying 8-core servers for exac

Re: [OMPI users] Execution in multicore machines

2008-09-30 Thread Torje Henriksen
Hi! If they are 8 core Intel machines, I believe this is the case: *) Each pair of cores share an L2-cache. So using two cores that share cache will probably reduce performance. *) Each Quad core CPU has its own memory bus (Dual independent bus), so using more than one core on a quad CPU can

Re: [OMPI users] Problem with MPI_Send and MPI_Recv

2008-09-30 Thread Jeff Squyres
On Sep 30, 2008, at 12:02 PM, Sofia Aparicio Secanellas wrote: 1. From prior mails, you do not seem to have iptables running to block any ports -- is there any other port blocking software running, perchance? I do not think so. What can I do to be sure that I do not have any other port bl

[OMPI users] 1.2.2 to 1.2.7 differences.

2008-09-30 Thread Joe Griffin
Hello All, My OpenMPI 1.2.2 version works fine. I think OPAL_PREFIX has changed. I install from RPM with --prefix. When I run with 1.2.7 I get the error: [em64t6:23450] [NO-NAME] ORTE_ERROR_LOG: Not found in file runtime/orte_init_stage1.c at line 182 ---

Re: [OMPI users] Running application with MPI_Comm_spawn() in multithreaded environment

2008-09-30 Thread Ralph Castain
Hi Roberto There is something wrong with this cmd line - perhaps it wasn't copied correctly? mpirun --verbose --debug-daemons --mca obl -np 1 -wdir `pwd` testmaster 1 $PBS_NODEFILE Specifically, the following is incomplete: --mca obl I'm not sure if this is the problem or not, but I