Hello all,


after the correct configuration, mpirun (v 1.10.2) works fine when all tpc 
ports are open. I can ssh to all hosts without a password.

Then it comes back to my first question: how to specify the ports for MPI 
communication?

I opened the ports 40000-50000 for outgoing traffic, when I run:

mpirun --mca btl_tcp_port_min_v4 40040 --mca btl_tcp_port_range_v4 10 --mca 
oob_tcp_static_ipv4_ports 40020 --host <IP1>,<IP2>  hostname

it works, but not every time. Same as when I run mpirun  --mca 
oob_tcp_static_ipv4_ports 40020 --host <IP1>,<IP2>  hostname

It is strange that sometimes I can get outputs, sometimes it just hangs. Did I 
miss something?



Best,

Ping





Von: users [mailto:users-boun...@open-mpi.org] Im Auftrag von Gilles 
Gouaillardet
Gesendet: Freitag, 3. Juni 2016 00:14
An: Open MPI Users
Betreff: Re: [OMPI users] Firewall settings for MPI communication



The syntax is

configure --enable-mpirun-prefix-by-default --prefix=<path to OpenMPI> ...



all hosts must be able to ssh each other passwordless.

that means you need to generate a user ssh key pair on all hosts, add your 
public keys to the list of authorized keys, and ssh to all hosts in order to 
populate your known hosts

(ssh requires you confirm host public keys the very first time you ssh to a new 
host)

iirc, that can be automated with ssh-keyscan.



when ssh is fully configured, mpirun should work just fine



Cheers,



Gilles


On Friday, June 3, 2016, Ping Wang < <mailto:ping.w...@asc-s.de> 
ping.w...@asc-s.de> wrote:

Hi,



thank you Gilles for your suggestion.  I tried:  mpirun --prefix <path to Open 
MPI>  --host <public IP> hostname, then it works.

I’m sure both IPs are the ones of the VM on which mpirun is running, and they 
are unique.



I also configured Open MPI with --enable-mpirun-prefix-by-default, but I still 
need to add --prefix <path to Open MPI> to get mpirun work.

I used: ./configure --enable-mpirun-prefix-by-default ="<path to Open MPI>  "
             make
             make install

Did I miss something or I misunderstood the way to configure Open MPI?



When I run: ssh < internal/public IP > `which orted`

The output is: Warning: Permanently added < internal/public IP > ' (ECDSA) to 
the list of known hosts.
/usr/local/bin/orted

Is it all right?



Cheers,

Ping





Von: users [mailto: 
<javascript:_e(%7B%7D,'cvml','users-boun...@open-mpi.org');> 
users-boun...@open-mpi.org] Im Auftrag von Gilles Gouaillardet
Gesendet: Donnerstag, 2. Juni 2016 17:06
An: Open MPI Users
Betreff: Re: [OMPI users] Firewall settings for MPI communication



are you saying both IP are the ones of the VM on which mpirun is running ?

orted is only launched on all the machines *except* the one running mpirun.



can you double/triple check the IPs are ok and unique ?

for example, mpirun --host <internal IP> /sbin/ifconfig -a

can you also make sure Open MPI is installed on all your VMs in the same 
directory ?

also make sure Open MPI has all the dependencies on all the VMs

ssh xxx ldd `which orted`

should show no missing dependency



generally speaking, I recommend you configure Open MPI with

--enable-mpirun-prefix-by-default



you can also try to replace

mpirun

with

`which mpirun`

or

mpirun --prefix <path to Open MPI>



Cheers,



Gilles

On Thursday, June 2, 2016, Ping Wang < 
<javascript:_e(%7B%7D,'cvml','ping.w...@asc-s.de');> ping.w...@asc-s.de> wrote:

Hi,

I've installed Open MPI v1.10.2. Every VM on the cloud has two IPs (internal 
IP, public IP).
When I run: mpirun --host <internal IP> hostname, the output is the hostname of 
the VM.
But when I run: mpirun --host <public IP> hostname, the output is

bash: orted: command not found
--------------------------------------------------------------------------
ORTE was unable to reliably start one or more daemons.
This usually is caused by:

* not finding the required libraries and/or binaries on
  one or more nodes. Please check your PATH and LD_LIBRARY_PATH
  settings, or configure OMPI with --enable-orterun-prefix-by-default

* lack of authority to execute on one or more specified nodes.
  Please verify your allocation and authorities.

* the inability to write startup files into /tmp (--tmpdir/orte_tmpdir_base).
  Please check with your sys admin to determine the correct location to use.

*  compilation of the orted with dynamic libraries when static are required
  (e.g., on Cray). Please check your configure cmd line and consider using
  one of the contrib/platform definitions for your system type.

* an inability to create a connection back to mpirun due to a
  lack of common network interfaces and/or no route found between
  them. Please check network connectivity (including firewalls
  and network routing requirements).

Both IPs are the IP of the VM where MPI is running. Did I do something wrong in 
the configuration?

Thanks for any help.

Ping

-----Ursprüngliche Nachricht-----
Von: users [mailto:users-boun...@open-mpi.org] Im Auftrag von Jeff Squyres 
(jsquyres)
Gesendet: Mittwoch, 1. Juni 2016 15:02
An: Open MPI User's List
Betreff: Re: [OMPI users] Firewall settings for MPI communication

In addition, you might want to consider upgrading to Open MPI v1.10.x (v1.6.x 
is fairly ancient).

> On Jun 1, 2016, at 7:46 AM, Gilles Gouaillardet < 
> <mailto:gilles.gouaillar...@gmail.com> gilles.gouaillar...@gmail.com> wrote:
>
> which network are your VMs using for communications ?
> if this is tcp, then you also have to specify a restricted set of
> allowed ports for the tcp btl
>
> that would be something like
> mpirun --mca btl_tcp_dynamic_ports 49990-50010 ...
>
> please double check the Open MPI 1.6.5 parameter and syntax with
> ompi_info --all (or check the archives, I think I posted the correct
> command line a few weeks ago)
>
> Cheers,
>
> Gilles
>
> On Wednesday, June 1, 2016, Ping Wang < <mailto:ping.w...@asc-s.de> 
> ping.w...@asc-s.de> wrote:
> I'm using Open MPI 1.6.5 to run OpenFOAM in parallel on several VMs on
> a cloud. mpirun hangs without any error messages. I think this is a
> firewall issue. Because when I open all the TCP ports(1-65535) in the
> security group of VMs, mpirun works well. However I was suggested to
> open as less ports as possible. So I have to limit MPI to run on a
> range of ports. I opened the port range 49990-50010 for MPI
> communication. And use command
>
>
>
> mpirun --mca oob_tcp_dynamic_ports 49990-50010 -np 4 --hostfile machines 
> simpleFoam –parallel.
>
>
>
> But it still hangs. How can I specify a port range that OpenMPI will use? I 
> appreciate any help you can provide.
>
>
>
> Best,
>
> Ping Wang
>
>
>
> <image001.png>
>
> ------------------------------------------------------
>
> Ping Wang
>
> Automotive Simulation Center Stuttgart e.V.
>
> Nobelstraße 15
>
> D-70569 Stuttgart
>
> Telefon: +49 711 699659-14
>
> Fax: +49 711 699659-29
>
> E-Mail:  <mailto:ping.w...@asc-s.de> ping.w...@asc-s.de
>
> Web:  <http://www.asc-s.de> http://www.asc-s.de
>
> Social Media: <image002.gif>/asc.stuttgart
>
> ------------------------------------------------------
>
>
>
>
>
> _______________________________________________
> users mailing list
>  <mailto:us...@open-mpi.org> us...@open-mpi.org
> Subscription:  <https://www.open-mpi.org/mailman/listinfo.cgi/users> 
> https://www.open-mpi.org/mailman/listinfo.cgi/users
> Link to this post:
>  <http://www.open-mpi.org/community/lists/users/2016/06/29340.php> 
> http://www.open-mpi.org/community/lists/users/2016/06/29340.php


--
Jeff Squyres
 <mailto:jsquy...@cisco.com> jsquy...@cisco.com
For corporate legal information go to:  
<http://www.cisco.com/web/about/doing_business/legal/cri/> 
http://www.cisco.com/web/about/doing_business/legal/cri/

_______________________________________________
users mailing list
 <mailto:us...@open-mpi.org> us...@open-mpi.org
Subscription:  <https://www.open-mpi.org/mailman/listinfo.cgi/users> 
https://www.open-mpi.org/mailman/listinfo.cgi/users
Link to this post:  
<http://www.open-mpi.org/community/lists/users/2016/06/29342.php> 
http://www.open-mpi.org/community/lists/users/2016/06/29342.php



_______________________________________________
users mailing list
 <mailto:us...@open-mpi.org> us...@open-mpi.org
Subscription:  <https://www.open-mpi.org/mailman/listinfo.cgi/users> 
https://www.open-mpi.org/mailman/listinfo.cgi/users
Link to this post:  
<http://www.open-mpi.org/community/lists/users/2016/06/29349.php> 
http://www.open-mpi.org/community/lists/users/2016/06/29349.php



Reply via email to