I believe this is an expected warning in OMPI 4.0.x series as the openib BTL is 
being deprecated (https://www.open-mpi.org/software/ompi/major-changes.php)

Try adding:
        --mca btl_openib_warn_no_device_params_found 0 --mca 
btl_openib_allow_ib true
To suppress this warning.

This issue (https://github.com/open-mpi/ompi/issues/6300) may be relevant.

> -----Original Message-----
> From: users <users-boun...@lists.open-mpi.org> On Behalf Of Prentice
> Bisbal via users
> Sent: Thursday, March 18, 2021 3:28 PM
> To: users@lists.open-mpi.org
> Cc: Prentice Bisbal <pbis...@pppl.gov>
> Subject: Re: [OMPI users] [External] Re: Error intialising an OpenFabrics
> device.
> 
> >   If you disable it with -mtl ^openib the warning goes away.
> And the performance of openib goes away right along with it.
> 
> Prentice
> 
> On 3/13/21 5:43 PM, Heinz, Michael William via users wrote:
> > I’ve begun getting this annoyingly generic warning, too. It appears to be
> coming from the openib provider. If you disable it with -mtl ^openib the
> warning goes away.
> >
> > Sent from my iPad
> >
> >> On Mar 13, 2021, at 3:28 PM, Bob Beattie via users <us...@lists.open-
> mpi.org> wrote:
> >>
> >> Hi everyone,
> >>
> >> To be honest, as an MPI / IB noob, I don't know if this falls under
> OpenMPI or Mellanox....
> >>
> >> Am running a small cluster of HP DL380 G6/G7 machines.
> >> Each runs Ubuntu server 20.04 and has a Mellanox ConnectX-3 card,
> connected by an IS dumb switch.
> >> When I begin my MPI program (snappyHexMesh for OpenFOAM) I get an
> error reported.
> >> The error doesn't stop my programs or appear to cause any problems, so
> this request for help is more about delving into the why.
> >>
> >> OMPI is compiled from source using v4.0.3; which is the default
> >> version for Ubuntu 20.04 This compiles and works.  I did this because I
> wanted to understand the compilation process whilst using a known working
> OMPI version.
> >>
> >> The Infiniband part is the Mellanox MLNXOFED installer v4.9-0.1.7.0
> >> and I install that with --dkms --without-fw-update --hpc
> >> --with-nfsrdma
> >>
> >> The actual error reported is:
> >> Warning: There was an error initialising an OpenFabrics device.
> >>    Local host:     of1
> >>    Local device: mlx4_0
> >>
> >> Then shortly after:
> >> [of1:1015399] 19 more processes have sent help message
> >> help-mpi-btl-openib.txt / error in device init [of1:1015399] Set MCA
> >> parameter "orte_base_help_aggregate" to 0 to see all help / error
> >> messages
> >>
> >> Adding this MCA parameter to the mpirun line simply gives me 20 or so
> copies of the first warning.
> >>
> >> Any ideas anyone ?
> >> Cheers,
> >> Bob.

Reply via email to