Now that my PR to autodetect user namespaces has been merged in Open MPI (thanks everyone for the help!) I tried running containers on UCX enabled installation. The whole UCX setup confuses me a lot.
Is it possible with UCX enabled installation to tell Open MPI to use vader for shared memory and not UCX? Because UCX seems to have similar assumptions for its shared memory communication as vader, that processes can talk to each other somehow: mm_posix.c:445 UCX ERROR Error returned from open in attach. Permission denied. File name is: /proc/24149/fd/16 mm_ep.c:75 UCX ERROR failed to connect to remote peer with mm. remote mm_id: 103719165231238 pml_ucx.c:383 Error: ucp_ep_create(proc=6) failed: Shared memory error If I disable UCX '--mca pml ^ucx', shared memory communication works again, but network based communication is not happening for also unknown reasons. I tried to configure UCX with environment variables based on https://github.com/openucx/ucx/wiki/UCX-environment-parameters but that did not work. So my question is, how can I use Open MPI with UCX, but vader for local communication? Everything I am doing uses user namespace based containers for every process. Adrian _______________________________________________ users mailing list users@lists.open-mpi.org https://lists.open-mpi.org/mailman/listinfo/users