They had a port configured for Ethernet and did not exclude it. OpenIB emits a warning about not finding a suitable CPC.
Josh On Thu, May 5, 2016 at 9:06 PM, Andy Riebs <andy.ri...@hpe.com> wrote: > Sorry, my output listing was incomplete -- the program did run after the > "No OpenFabrics" message, but (I presume) ran over Ethernet rather than > InfiniBand. So I can't really say what was causing it to fail. > > Andy > > > On 05/05/2016 06:09 PM, Nathan Hjelm wrote: > > It should work fine with ob1 (the default). Did you determine what was > causing it to fail? > > -Nathan > > On Thu, May 05, 2016 at 06:04:55PM -0400, Andy Riebs wrote: > > For anyone like me who happens to google this in the future, the solution > was to set OMPI_MCA_pml=yalla > > Many thanks Josh! > > On 05/05/2016 12:52 PM, Joshua Ladd wrote: > > We are working with Andy offline. > > Josh > On Thu, May 5, 2016 at 7:32 AM, Andy Riebs <andy.ri...@hpe.com> > <andy.ri...@hpe.com> wrote: > > I've built 1.10.2 with all my favorite configuration options, but I > get messages such as this (one for each rank with > orte_base_help_aggregate=0) when I try to run on a MOFED system: > > $ shmemrun -H hades02,hades03 $PWD/shmem.out > > -------------------------------------------------------------------------- > No OpenFabrics connection schemes reported that they were able to be > used on a specific port. As such, the openib BTL (OpenFabrics > support) will be disabled for this port. > > Local host: hades03 > Local device: mlx4_0 > Local port: 2 > CPCs attempted: rdmacm, udcm > > -------------------------------------------------------------------------- > > My configure options: > config_opts="--prefix=${INSTALL_DIR} \ > --without-mpi-param-check \ > --with-knem=/opt/mellanox/hpcx/knem \ > --with-mxm=/opt/mellanox/mxm \ > --with-mxm-libdir=/opt/mellanox/mxm/lib \ > --with-fca=/opt/mellanox/fca \ > --with-pmi=${INSTALL_ROOT}/slurm \ > --without-psm --disable-dlopen \ > --disable-vt \ > --enable-orterun-prefix-by-default \ > --enable-debug-symbols" > > There aren't any obvious error messages in the build log -- what am I > missing? > > Andy > > -- > Andy Riebs > andy.ri...@hpe.com > Hewlett-Packard Enterprise > High Performance Computing Software Engineering > +1 404 648 9024 > My opinions are not necessarily those of HPE > > _______________________________________________ > users mailing list > us...@open-mpi.org > Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/users > Link to this post: > http://www.open-mpi.org/community/lists/users/2016/05/29094.php > > _______________________________________________ > users mailing list > us...@open-mpi.org > Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/users > Link to this post: > http://www.open-mpi.org/community/lists/users/2016/05/29100.php > > _______________________________________________ > users mailing listus...@open-mpi.org > Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/users > Link to this post: > http://www.open-mpi.org/community/lists/users/2016/05/29101.php > > > > _______________________________________________ > users mailing listus...@open-mpi.org > Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/users > > Link to this post: > http://www.open-mpi.org/community/lists/users/2016/05/29102.php > > > > _______________________________________________ > users mailing list > us...@open-mpi.org > Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/users > Link to this post: > http://www.open-mpi.org/community/lists/users/2016/05/29104.php >