Hello Terry,
I do not have an active firewall. I have typed on both computers:
netstat -lnut
I enclose you the results.
I have also written on both computers:
mpirun -np 2 --host 10.1.10.208,10.1.10.240 --mca mpi_preconnect_all
1 --prefix /usr/local -mca btl self,tcp -mca btl_tcp_if_include eth1
./PruebaSumaParalela.out
I enclose you the results.
Thank you.
Sofia
----- Original Message -----
From: "Terry Dontje" <terry.don...@sun.com>
To: <us...@open-mpi.org>
Sent: Friday, September 19, 2008 7:54 PM
Subject: Re: [OMPI users] Problem with MPI_Send and MPI_Recv
Hello Sofia,
After further reflection I wonder if you have a firewall that is
preventing connections to certain ports.
--td
Terry Dontje wrote:
Hello Sofia,
Ok, so I really wanted the stack of when you run with "-mca
mpi_preconnect_all 1" I believe you'll see that one of the processes
will be in init. However, the stack still probably will not help me help
you. What needs to happen is to step through the code in dbx while the
connection is trying to be established. I am hoping you might find the
connect call fails or that we've been given an interface that somehow
cannot reach the other node. However, when you specified "-mca
btl_tcp_if_include eth1" that should have forced things to use the
interface you need. So it really comes down to why are we not connecting
to the eth1 address? Are we failing on routing to that address or is the
connect failing because we are trying to use a port that we are not
really allowed to use or is it something else?
I don't think it is a routing problem since you are able to reach each
node via ssh. Is there someone else on the list that might want to lend
a hand here? I feel like I am missing something obvious going on here.
--td
Date: Fri, 19 Sep 2008 16:09:11 +0200
From: "Sofia Aparicio Secanellas" <sapari...@grpss.ssr.upm.es>
Subject: Re: [OMPI users] Problem with MPI_Send and MPI_Recv
To: "Open MPI Users" <us...@open-mpi.org>
Message-ID: <1BBF50FE29F743B5829CC3785F47CADD@aparicio1>
Content-Type: text/plain; charset="iso-8859-1"; Format="flowed"
Hello Terry,
I have installed 1.2.7 and I obtain the same result.
I will explain you what I have done.
1. On my computer edu@10.1.10.240 I have added a new user called sofia.
This way I have sofia@10.1.10.208 and sofia@10.1.10.240.
2. I have downloaded the openmpi 1.2.7 from the openmpi website on both
computers in /home/sofia/Desktop.
3. I have installed everything using "sudo ./configure", "sudo make" and
"sudo make install".
4. To make ssh not ask me for a password. I have typed in
sofia@10.1.10.208 "ssh-keygen -t dsa", "cd $HOME/.ssh" and "cp
id_dsa.pub authorized_keys". I have copied the directory
"/home/sofia/.ssh" from sofia@10.1.10.208 to /home/sofia/.ssh in
sofia@10.1.10.240. The ssh command without password works on computer
sofia@10.1.10.208 but computer sofia@10.1.10.208 ask me for a
passphrase and for the password. Is it normal?
5. I have created a directory "/home/sofia/programasparalelos" on both
computers and I have given permissions to the directory with "chmod
777".
6. I have copied on both computers in "/home/sofia/programasparalelos"
the program "PruebaSumaParalela.c" (I have changed a little bit the
program, I enclose you the new program) and I have compiled using "mpicc
PruebaSumaParalela.c -o PruebaSumaParalela.out".
7. Now I run the program on both computersusing the command:
mpirun -np2 --host 10.1.10.208,10.1.10.240 --prefix /usr/local
./PruebaSumaParalela.out
When I run the program I obtain 3 PIDs executing on every computer, 2
of "./PruebaSumaParalela.out" and 1 of "mpirun -np2 --host
10.1.10.208,10.1.10.240 --prefix /usr/local ./PruebaSumaParalela.out". I
enclose you the results obtained on every computer for every
"./PruebaSumaParalela.out".
Thank you very much.
Sofia
_______________________________________________
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users
No virus found in this incoming message
Checked by PC Tools AntiVirus (4.0.0.26 - 10.100.007).
http://www.pctools.com/free-antivirus/
No virus found in this outgoing message
Checked by PC Tools AntiVirus (4.0.0.26 - 10.100.007).
http://www.pctools.com/free-antivirus/
current thread: t@3083814576
[1] 0xffffe410(0x0, 0x8051584, 0xb7d1fc29, 0x0, 0x0, 0xb7ec48d0), at 0xffffe410
[2] sigprocmask(0x0, 0x8051584, 0x0, 0xbfa13c40, 0x0, 0xb7ee4458), at 0xb7d1fc29
[3] opal_evsignal_recalc(0x8051584, 0x4, 0x0, 0xb7ee4458, 0x8051568, 0x4), at 0xb7ec47f4
[4] opal_poll_dispatch(0x8051530, 0x8051568, 0xbfa13d40, 0xb7f668b4, 0x80482e8, 0x1), at 0xb7ec5870
[5] opal_event_base_loop(0x8051530, 0x2, 0x0, 0xb7ee4458, 0xb7ec2e69), at 0xb7ec2ae9
[6] opal_event_loop(0x2, 0xb7c301a8, 0x1a9b860, 0x0, 0x1a9dd9c, 0x0), at 0xb7ec2e89
[7] opal_progress(0x820ca00, 0x8049c00, 0x1, 0xbfa13e68, 0xb7fda408, 0xb7fda408), at 0xb7ebd606
[8] mca_pml_ob1_recv(0xbfa13e68, 0x1, 0x8049c00, 0x1, 0x1, 0x8049d40), at 0xb7c25e56
[9] PMPI_Recv(0xbfa13e68, 0x1, 0x8049c00, 0x1, 0x1, 0x8049d40), at 0xb7fb1473
[10] main(0xb7ff7ce0, 0x8048950, 0xbfa13ef8, 0xb7d0b050, 0x1, 0xbfa13f24), at 0x80488f3
current thread: t@3083531968
[1] 0xffffe410(0x0, 0x8051ccc, 0xb7cda1a9, 0x0, 0x0, 0xb7e75b40), at 0xffffe410
[2] sigprocmask(0x0, 0x8051ccc, 0x0, 0xbfed9da0, 0x0, 0xb7e95458), at 0xb7cda1a9
[3] opal_evsignal_recalc(0x8051ccc, 0x4, 0x0, 0xb7e95458, 0x8051cb0, 0x4), at 0xb7e75a64
[4] opal_poll_dispatch(0x8051c78, 0x8051cb0, 0xbfed9ea0, 0xb7e8b114, 0xb7f188b4, 0xbfed9ee4), at 0xb7e76ae0
[5] opal_event_base_loop(0x8051c78, 0x2, 0x0, 0xb7e95458, 0xb7e740d9), at 0xb7e73d59
[6] opal_event_loop(0x2, 0xb7f806fc, 0x989680, 0x0, 0x98c2c2, 0x0), at 0xb7e740f9
[7] opal_progress(0xb7f19370, 0xef, 0xb7f0d000, 0x738ec, 0xb7f8c428, 0xb7f806fc), at 0xb7e6e646
[8] ompi_request_wait_all(0x1, 0x820eef8, 0x0, 0x0, 0x1, 0x8049d38), at 0xb7f3ca7a
[9] ompi_init_do_preconnect(0x2, 0x0, 0x0, 0xb7c6a6c0, 0x0, 0x0), at 0xb7f3edcd
[10] ompi_mpi_init(0x1, 0xbfeda144, 0x0, 0xbfeda038, 0xb7f16864, 0xbfeda0c0), at 0xb7f3d595
[11] PMPI_Init(0xbfeda0c0, 0xbfeda0c4, 0x0, 0x0, 0x0, 0x0), at 0xb7f600cb
[12] main(0xb7fa8ce0, 0x8048950, 0xbfeda118, 0xb7cc5ebc, 0x1, 0xbfeda144), at 0x80487f3
current thread: t@3083126448
=>[1] __sigaction(0x8051584), at 0xb7da5831
[2] opal_poll_recalc(0x8051530, 0x8051568, 0x0, 0xb7ebe8b4, 0x80482e8, 0x1), at 0xb7e1d7d0
[3] opal_event_base_loop(0x8051530, 0x2, 0x0, 0xb7e3c458, 0xb7e1ae69), at 0xb7e1a9b5
[4] opal_event_loop(0x2, 0xb7b881a8, 0x1a9b860, 0x0, 0x1a9dd40, 0x0), at 0xb7e1ae89
[5] opal_progress(0x820ca00, 0x8049c00, 0x1, 0xbfa90c38, 0xb7f32408, 0xb7f32408), at 0xb7e15606
[6] mca_pml_ob1_recv(0xbfa90c38, 0x1, 0x8049c00, 0x1, 0x1, 0x8049d40), at 0xb7b7de56
[7] PMPI_Recv(0xbfa90c38, 0x1, 0x8049c00, 0x1, 0x1, 0x8049d40), at 0xb7f09473
[8] main(0xb7f4fce0, 0x8048950, 0xbfa90cc8, 0xb7c63050, 0x1, 0xbfa90cf4), at 0x80488f3
current thread: t@3083523776
[1] 0xffffe410(0xb7e93458, 0x0, 0xb7d2eec6, 0xb7e71d75, 0xbfe80778, 0x0), at 0xffffe410
[2] __gettimeofday(0x8051c78, 0x2, 0x0, 0xb7e93458, 0xb7e720d9), at 0xb7d2eec6
[3] opal_event_loop(0x2, 0xb7f7e6fc, 0x989680, 0x0, 0x98c2c2, 0x0), at 0xb7e720f9
[4] opal_progress(0xb7f17370, 0xef, 0xb7f0b000, 0x738ec, 0xb7f8a428, 0xb7f7e6fc), at 0xb7e6c646
[5] ompi_request_wait_all(0x1, 0x820eee0, 0x0, 0x0, 0x1, 0x8049d38), at 0xb7f3aa7a
[6] ompi_init_do_preconnect(0x2, 0x0, 0x0, 0xb7c686c0, 0x0, 0x0), at 0xb7f3cdcd
[7] ompi_mpi_init(0x1, 0xbfe80a24, 0x0, 0xbfe80918, 0xb7f14864, 0xbfe809a0), at 0xb7f3b595
[8] PMPI_Init(0xbfe809a0, 0xbfe809a4, 0x0, 0x0, 0x0, 0x0), at 0xb7f5e0cb
[9] main(0xb7fa6ce0, 0x8048950, 0xbfe809f8, 0xb7cc3ebc, 0x1, 0xbfe80a24), at 0x80487f3
Conexiones activas de Internet (solo servidores)
Protocolo Recv-Q Send-Q Dirección Local Dirección Externa Estado
tcp 0 0 0.0.0.0:56325 0.0.0.0:* ESCUCHAR
tcp 0 0 0.0.0.0:55372 0.0.0.0:* ESCUCHAR
tcp 0 0 127.0.0.1:631 0.0.0.0:* ESCUCHAR
tcp 0 0 0.0.0.0:47260 0.0.0.0:* ESCUCHAR
tcp6 0 0 :::22 :::* ESCUCHAR
udp 0 0 0.0.0.0:32768 0.0.0.0:*
udp 0 0 0.0.0.0:68 0.0.0.0:*
udp 0 0 0.0.0.0:5353 0.0.0.0:*
Active Internet connections (only servers)
Proto Recv-Q Send-Q Local Address Foreign Address State
tcp 0 0 0.0.0.0:60160 0.0.0.0:* LISTEN
tcp 0 0 127.0.0.1:2208 0.0.0.0:* LISTEN
tcp 0 0 0.0.0.0:56930 0.0.0.0:* LISTEN
tcp 0 0 0.0.0.0:58761 0.0.0.0:* LISTEN
tcp 0 0 0.0.0.0:59339 0.0.0.0:* LISTEN
tcp 0 0 0.0.0.0:8880 0.0.0.0:* LISTEN
tcp 0 0 0.0.0.0:21 0.0.0.0:* LISTEN
tcp 0 0 127.0.0.1:631 0.0.0.0:* LISTEN
tcp 0 0 127.0.0.1:2207 0.0.0.0:* LISTEN
tcp6 0 0 :::22 :::* LISTEN
udp 0 0 0.0.0.0:32768 0.0.0.0:*
udp 0 0 0.0.0.0:68 0.0.0.0:*
udp 0 0 0.0.0.0:5353 0.0.0.0:*