I solved the problem. For some reason the
OMPI_MCA_btl_openib_cpc_include environment variable was set to udcm
during the tests. By ensuring that it is set to rdmacm solved the
issue.
Thanks anyway!
Davide
On Thu, 2016-03-03 at 16:40 -0600, Davide Vanzo wrote:
> Hi all,
> In our cluster the nodes a
Hi all,
In our cluster the nodes are interconnected with RoCE and I want to set
up OpenMPI to run on it via SLURM.
I initially compiled OpenMPI 1.10.2 only with IB verbs support and I
have no problem making it run over RoCE.
Then I have successfully built it with SLURM support as follows:
./config