The PR simply disables nvml in hwloc is CUDA is disabled in Open MPI.
it also add cuda directory to CPPFLAGS, so there should be no need to
manually add -I/usr/local/cuda/include to CPPFLAGS.
Siegmar,
could you please post your config.log
also, is there a nvml.h file in /usr/local/cuda/include
last but not least, can you please run
make V=1
and post the output related to the compilation of topology-nvml.lo
Thanks and regards,
Gilles
On 4/21/2017 3:07 AM, r...@open-mpi.org wrote:
This is a known issue due to something in the NVIDIA library and it’s
interactions with hwloc. Your tarball tag indicates you should have
the attempted fix in it, so likely that wasn’t adequate. See
https://github.com/open-mpi/ompi/pull/3283 for the discussion
On Apr 20, 2017, at 8:11 AM, Siegmar Gross
<siegmar.gr...@informatik.hs-fulda.de
<mailto:siegmar.gr...@informatik.hs-fulda.de>> wrote:
Hi,
I tried to install openmpi-master-201704200300-ded63c on my "SUSE Linux
Enterprise Server 12.2 (x86_64)" with Sun C 5.14 and gcc-6.3.0.
Unfortunately, "make" breaks with the following error for gcc. I've had
no problems with cc.
loki openmpi-master-201704200300-ded63c5-Linux.x86_64.64_gcc 136 grep
topology log.make.Linux.x86_64.64_gcc
CC topology.lo
CC topology-noos.lo
CC topology-synthetic.lo
CC topology-custom.lo
CC topology-xml.lo
CC topology-xml-nolibxml.lo
CC topology-pci.lo
CC topology-nvml.lo
../../../../../../../openmpi-master-201704200300-ded63c5/opal/mca/hwloc/hwloc1116/hwloc/src/topology-nvml.c:14:18:
fatal error: nvml.h: No such file or directory
Makefile:2181: recipe for target 'topology-nvml.lo' failed
make[4]: *** [topology-nvml.lo] Error 1
loki openmpi-master-201704200300-ded63c5-Linux.x86_64.64_gcc 137
loki openmpi-master-201704200300-ded63c5-Linux.x86_64.64_gcc 137 grep
topology
../openmpi-master-201704200300-ded63c5-Linux.x86_64.64_cc/log.make.Linux.x86_64.64_cc
CC topology.lo
CC topology-noos.lo
CC topology-synthetic.lo
"../../../../../../../openmpi-master-201704200300-ded63c5/opal/mca/hwloc/hwloc1116/hwloc/src/topology-synthetic.c",
line 851: warning: initializer will be sign-extended: -1
CC topology-custom.lo
"../../../../../../../openmpi-master-201704200300-ded63c5/opal/mca/hwloc/hwloc1116/hwloc/src/topology-custom.c",
line 88: warning: initializer will be sign-extended: -1
CC topology-xml.lo
"../../../../../../../openmpi-master-201704200300-ded63c5/opal/mca/hwloc/hwloc1116/hwloc/src/topology-xml.c",
line 1815: warning: initializer will be sign-extended: -1
CC topology-xml-nolibxml.lo
CC topology-pci.lo
CC topology-nvml.lo
CC topology-linux.lo
"../../../../../../../openmpi-master-201704200300-ded63c5/opal/mca/hwloc/hwloc1116/hwloc/src/topology-linux.c",
line 2919: warning: initializer will be sign-extended: -1
"../../../../../../../openmpi-master-201704200300-ded63c5/opal/mca/hwloc/hwloc1116/hwloc/src/topology-linux.c",
line 2919: warning: initializer will be sign-extended: -1
"../../../../../../../openmpi-master-201704200300-ded63c5/opal/mca/hwloc/hwloc1116/hwloc/src/topology-linux.c",
line 2919: warning: initializer will be sign-extended: -1
CC topology-hardwired.lo
CC topology-x86.lo
"../../../../../../../openmpi-master-201704200300-ded63c5/opal/mca/hwloc/hwloc1116/hwloc/src/topology-x86.c",
line 122: warning: initializer will be sign-extended: -1
loki openmpi-master-201704200300-ded63c5-Linux.x86_64.64_gcc 138
I used the following commands to configure the package.
loki openmpi-master-201704200300-ded63c5-Linux.x86_64.64_gcc 145 head
-7 config.log |tail -1
$ ../openmpi-master-201704200300-ded63c5/configure
--prefix=/usr/local/openmpi-master_64_gcc
--libdir=/usr/local/openmpi-master_64_gcc/lib64
--with-jdk-bindir=/usr/local/jdk1.8.0_66/bin
--with-jdk-headers=/usr/local/jdk1.8.0_66/include
JAVA_HOME=/usr/local/jdk1.8.0_66 LDFLAGS=-m64 CC=gcc CXX=g++
FC=gfortran CFLAGS=-m64 CXXFLAGS=-m64 FCFLAGS=-m64 CPP=cpp CXXCPP=cpp
--enable-mpi-cxx --enable-cxx-exceptions --enable-mpi-java
--with-cuda=/usr/local/cuda --with-valgrind=/usr/local/valgrind
--with-hwloc=internal --without-verbs --with-wrapper-cflags=-std=c11
-m64 --with-wrapper-cxxflags=-m64 --with-wrapper-fcflags=-m64
--enable-debug
loki openmpi-master-201704200300-ded63c5-Linux.x86_64.64_gcc 146
loki openmpi-master-201704200300-ded63c5-Linux.x86_64.64_gcc 146 head
-7
../openmpi-master-201704200300-ded63c5-Linux.x86_64.64_cc/config.log
| tail -1
$ ../openmpi-master-201704200300-ded63c5/configure
--prefix=/usr/local/openmpi-master_64_cc
--libdir=/usr/local/openmpi-master_64_cc/lib64
--with-jdk-bindir=/usr/local/jdk1.8.0_66/bin
--with-jdk-headers=/usr/local/jdk1.8.0_66/include
JAVA_HOME=/usr/local/jdk1.8.0_66 LDFLAGS=-m64 -mt -Wl,-z
-Wl,noexecstack -L/usr/local/lib64 -L/usr/local/cuda/lib64 CC=cc
CXX=CC FC=f95 CFLAGS=-m64 -mt -I/usr/local/include
-I/usr/local/cuda/include CXXFLAGS=-m64 -I/usr/local/include
-I/usr/local/cuda/include FCFLAGS=-m64 CPP=cpp -I/usr/local/include
-I/usr/local/cuda/include CXXCPP=cpp -I/usr/local/include
-I/usr/local/cuda/include --enable-mpi-cxx --enable-cxx-exceptions
--enable-mpi-java --with-cuda=/usr/local/cuda
--with-valgrind=/usr/local/valgrind --with-hwloc=internal
--without-verbs --with-wrapper-cflags=-m64 -mt
--with-wrapper-cxxflags=-m64 --with-wrapper-fcflags=-m64
--with-wrapper-ldflags=-mt --enable-debug
loki openmpi-master-201704200300-ded63c5-Linux.x86_64.64_gcc 147
I would be grateful, if somebody can fix the problem. Do you need
anything
else? Thank you very much for any help in advance.
Kind regards
Siegmar
_______________________________________________
users mailing list
users@lists.open-mpi.org <mailto:users@lists.open-mpi.org>
https://rfd.newmexicoconsortium.org/mailman/listinfo/users
_______________________________________________
users mailing list
users@lists.open-mpi.org
https://rfd.newmexicoconsortium.org/mailman/listinfo/users
_______________________________________________
users mailing list
users@lists.open-mpi.org
https://rfd.newmexicoconsortium.org/mailman/listinfo/users