Now that my PR to autodetect user namespaces has been merged in Open MPI
(thanks everyone for the help!) I tried running containers on UCX
enabled installation. The whole UCX setup confuses me a lot.

Is it possible with UCX enabled installation to tell Open MPI to use
vader for shared memory and not UCX? Because UCX seems to have similar
assumptions for its shared memory communication as vader, that processes
can talk to each other somehow:

mm_posix.c:445  UCX  ERROR Error returned from open in attach. Permission 
denied. File name is: /proc/24149/fd/16
   mm_ep.c:75   UCX  ERROR failed to connect to remote peer with mm. remote 
mm_id: 103719165231238
pml_ucx.c:383  Error: ucp_ep_create(proc=6) failed: Shared memory error

If I disable UCX '--mca pml ^ucx', shared memory communication works again,
but network based communication is not happening for also unknown reasons.

I tried to configure UCX with environment variables based on
https://github.com/openucx/ucx/wiki/UCX-environment-parameters
but that did not work.

So my question is, how can I use Open MPI with UCX, but vader for
local communication?

Everything I am doing uses user namespace based containers for every
process.

                Adrian
_______________________________________________
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users

Reply via email to