The PR simply disables nvml in hwloc is CUDA is disabled in Open MPI.

it also add cuda directory to CPPFLAGS, so there should be no need to manually add -I/usr/local/cuda/include to CPPFLAGS.


Siegmar,

could you please post your config.log

also, is there a nvml.h file in /usr/local/cuda/include

last but not least, can you please run

make V=1

and post the output related to the compilation of topology-nvml.lo


Thanks and regards,


Gilles


On 4/21/2017 3:07 AM, r...@open-mpi.org wrote:
This is a known issue due to something in the NVIDIA library and it’s interactions with hwloc. Your tarball tag indicates you should have the attempted fix in it, so likely that wasn’t adequate. See https://github.com/open-mpi/ompi/pull/3283 for the discussion


On Apr 20, 2017, at 8:11 AM, Siegmar Gross <siegmar.gr...@informatik.hs-fulda.de <mailto:siegmar.gr...@informatik.hs-fulda.de>> wrote:

Hi,

I tried to install openmpi-master-201704200300-ded63c on my "SUSE Linux
Enterprise Server 12.2 (x86_64)" with Sun C 5.14 and gcc-6.3.0.
Unfortunately, "make" breaks with the following error for gcc. I've had
no problems with cc.


loki openmpi-master-201704200300-ded63c5-Linux.x86_64.64_gcc 136 grep topology log.make.Linux.x86_64.64_gcc
 CC       topology.lo
 CC       topology-noos.lo
 CC       topology-synthetic.lo
 CC       topology-custom.lo
 CC       topology-xml.lo
 CC       topology-xml-nolibxml.lo
 CC       topology-pci.lo
 CC       topology-nvml.lo
../../../../../../../openmpi-master-201704200300-ded63c5/opal/mca/hwloc/hwloc1116/hwloc/src/topology-nvml.c:14:18: fatal error: nvml.h: No such file or directory
Makefile:2181: recipe for target 'topology-nvml.lo' failed
make[4]: *** [topology-nvml.lo] Error 1
loki openmpi-master-201704200300-ded63c5-Linux.x86_64.64_gcc 137





loki openmpi-master-201704200300-ded63c5-Linux.x86_64.64_gcc 137 grep topology ../openmpi-master-201704200300-ded63c5-Linux.x86_64.64_cc/log.make.Linux.x86_64.64_cc
 CC       topology.lo
 CC       topology-noos.lo
 CC       topology-synthetic.lo
"../../../../../../../openmpi-master-201704200300-ded63c5/opal/mca/hwloc/hwloc1116/hwloc/src/topology-synthetic.c", line 851: warning: initializer will be sign-extended: -1
 CC       topology-custom.lo
"../../../../../../../openmpi-master-201704200300-ded63c5/opal/mca/hwloc/hwloc1116/hwloc/src/topology-custom.c", line 88: warning: initializer will be sign-extended: -1
 CC       topology-xml.lo
"../../../../../../../openmpi-master-201704200300-ded63c5/opal/mca/hwloc/hwloc1116/hwloc/src/topology-xml.c", line 1815: warning: initializer will be sign-extended: -1
 CC       topology-xml-nolibxml.lo
 CC       topology-pci.lo
 CC       topology-nvml.lo
 CC       topology-linux.lo
"../../../../../../../openmpi-master-201704200300-ded63c5/opal/mca/hwloc/hwloc1116/hwloc/src/topology-linux.c", line 2919: warning: initializer will be sign-extended: -1 "../../../../../../../openmpi-master-201704200300-ded63c5/opal/mca/hwloc/hwloc1116/hwloc/src/topology-linux.c", line 2919: warning: initializer will be sign-extended: -1 "../../../../../../../openmpi-master-201704200300-ded63c5/opal/mca/hwloc/hwloc1116/hwloc/src/topology-linux.c", line 2919: warning: initializer will be sign-extended: -1
 CC       topology-hardwired.lo
 CC       topology-x86.lo
"../../../../../../../openmpi-master-201704200300-ded63c5/opal/mca/hwloc/hwloc1116/hwloc/src/topology-x86.c", line 122: warning: initializer will be sign-extended: -1
loki openmpi-master-201704200300-ded63c5-Linux.x86_64.64_gcc 138




I used the following commands to configure the package.

loki openmpi-master-201704200300-ded63c5-Linux.x86_64.64_gcc 145 head -7 config.log |tail -1 $ ../openmpi-master-201704200300-ded63c5/configure --prefix=/usr/local/openmpi-master_64_gcc --libdir=/usr/local/openmpi-master_64_gcc/lib64 --with-jdk-bindir=/usr/local/jdk1.8.0_66/bin --with-jdk-headers=/usr/local/jdk1.8.0_66/include JAVA_HOME=/usr/local/jdk1.8.0_66 LDFLAGS=-m64 CC=gcc CXX=g++ FC=gfortran CFLAGS=-m64 CXXFLAGS=-m64 FCFLAGS=-m64 CPP=cpp CXXCPP=cpp --enable-mpi-cxx --enable-cxx-exceptions --enable-mpi-java --with-cuda=/usr/local/cuda --with-valgrind=/usr/local/valgrind --with-hwloc=internal --without-verbs --with-wrapper-cflags=-std=c11 -m64 --with-wrapper-cxxflags=-m64 --with-wrapper-fcflags=-m64 --enable-debug
loki openmpi-master-201704200300-ded63c5-Linux.x86_64.64_gcc 146




loki openmpi-master-201704200300-ded63c5-Linux.x86_64.64_gcc 146 head -7 ../openmpi-master-201704200300-ded63c5-Linux.x86_64.64_cc/config.log | tail -1 $ ../openmpi-master-201704200300-ded63c5/configure --prefix=/usr/local/openmpi-master_64_cc --libdir=/usr/local/openmpi-master_64_cc/lib64 --with-jdk-bindir=/usr/local/jdk1.8.0_66/bin --with-jdk-headers=/usr/local/jdk1.8.0_66/include JAVA_HOME=/usr/local/jdk1.8.0_66 LDFLAGS=-m64 -mt -Wl,-z -Wl,noexecstack -L/usr/local/lib64 -L/usr/local/cuda/lib64 CC=cc CXX=CC FC=f95 CFLAGS=-m64 -mt -I/usr/local/include -I/usr/local/cuda/include CXXFLAGS=-m64 -I/usr/local/include -I/usr/local/cuda/include FCFLAGS=-m64 CPP=cpp -I/usr/local/include -I/usr/local/cuda/include CXXCPP=cpp -I/usr/local/include -I/usr/local/cuda/include --enable-mpi-cxx --enable-cxx-exceptions --enable-mpi-java --with-cuda=/usr/local/cuda --with-valgrind=/usr/local/valgrind --with-hwloc=internal --without-verbs --with-wrapper-cflags=-m64 -mt --with-wrapper-cxxflags=-m64 --with-wrapper-fcflags=-m64 --with-wrapper-ldflags=-mt --enable-debug
loki openmpi-master-201704200300-ded63c5-Linux.x86_64.64_gcc 147



I would be grateful, if somebody can fix the problem. Do you need anything
else? Thank you very much for any help in advance.


Kind regards

Siegmar
_______________________________________________
users mailing list
users@lists.open-mpi.org <mailto:users@lists.open-mpi.org>
https://rfd.newmexicoconsortium.org/mailman/listinfo/users



_______________________________________________
users mailing list
users@lists.open-mpi.org
https://rfd.newmexicoconsortium.org/mailman/listinfo/users

_______________________________________________
users mailing list
users@lists.open-mpi.org
https://rfd.newmexicoconsortium.org/mailman/listinfo/users

Reply via email to