Hello,
I'm having communications problem between two processes (with one being
spawned by the other, on the *same* physical machine). Everything works
as expected when I have network interface such as eth0 or wlo1 up, but
as soon as they are down, I get errors (such as « At least one pair of
MPI processes are unable to reach each other for MPI communications […] »).
I tried to specify a set of mca parameters including the btl "self"
parameter and including the lo interface in btl_tcp_if_include list, as
advised by https://www.open-mpi.org/faq/?category=tcp but I didn't reach
any working state for this code with "external" network interface down.

Got any idea about what I might do wrong ?

Example code that triggers the problem: https://ptpb.pw/YOjr.tar.gz
Ompi_info:  https://ptpb.pw/Vt_V.txt
Full log: https://ptpb.pw/JCXn.txt

Rémy


Attachment: signature.asc
Description: OpenPGP digital signature

Reply via email to