Ah, ha!
Yes, that seems to be it. Thanks.
If I might, on a configure related note ask, whether, if we have
these installed with the CentOS 7.6 we are running
$ yum list installed libevent\*
Loaded plugins: langpacks
Installed Packages
libevent.x86_64 2.0.21-4.el7 @anaconda
libevent-devel.x86_64 2.0.21-4.el7 @os
should be be able to use this?
./configure ... --with-libevent=external --with-hwloc=external
My coworker reported that something was awry using that, and he's put instead
./configure ... --with-libevent=external --with-hwloc=/usr
I believe that the problem was that if we did not specify /usr, then
srun and mpirun were unable to find the interfaces. But I also recall
from an earlier thread that is very much not recommended.
We are still struggling with new IB hardware, new scheduler (Slurm),
PMIx, and OpenMPI, so I am a bit muddled about how all the moving
pieces work yet.
On Sun, Feb 2, 2020 at 4:16 PM Jeff Squyres (jsquyres)
<[email protected]> wrote:
>
> Bennet --
>
> Just curious: is there a reason you're not using UCX?
>
>
> > On Feb 2, 2020, at 4:06 PM, Bennet Fauber via users
> > <[email protected]> wrote:
> >
> > We get these warnings/error from OpenMPI, version 3.1.4 and 4.0.2
> >
> > --------------------------------------------------------------------------
> > WARNING: No preset parameters were found for the device that Open MPI
> > detected:
> >
> > Local host: gl3080
> > Device name: mlx5_0
> > Device vendor ID: 0x02c9
> > Device vendor part ID: 4123
> >
> > Default device parameters will be used, which may result in lower
> > performance. You can edit any of the files specified by the
> > btl_openib_device_param_files MCA parameter to set values for your
> > device.
> >
> > NOTE: You can turn off this warning by setting the MCA parameter
> > btl_openib_warn_no_device_params_found to 0.
> > --------------------------------------------------------------------------
> >
> > --------------------------------------------------------------------------
> > WARNING: There was an error initializing an OpenFabrics device.
> >
> > Local host: gl3080
> > Local device: mlx5_0
> > --------------------------------------------------------------------------
> >
> > Does anyone know how I can find the parameters that should be set in
> > $PREFIX/etc/btl_openib_device_param.conf or other OpenMPI
> > configuration files so that those warnings do not occur?
> >
> > How might I find the cause of the initialization error?
> >
> > Sorry for the ignorance behind this question.
>
>
> --
> Jeff Squyres
> [email protected]
>