On May 15, 2018, at 1:39 AM, Max Mellette <wmell...@ucsd.edu> wrote: > > Thanks everyone for all your assistance. The problem seems to be resolved > now, although I'm not entirely sure why these changes made a difference. > There were two things I changed: > > (1) I had some additional `export ...` lines in .bashrc before the `export > PATH=...` and `export LD_LIBRARY_PATH=...` lines. When I removed those lines > (and then later added them back in below the PATH and LD_LIBRARY_PATH lines) > mpirun worked. But only b09-30 was able to execute code on b09-32 and not the > other way around.
It depends on what those "export..." Lines were, and whether you moved them below where non-interactive shells exited your .bashrc. > (2) I passed IP addresses to mpirun instead of the hostnames (this didn't > work previously), and now mpirun works in both directions (b09-30 -> b09-32 > and b09-32 -> b09-30). I added a 3rd host in the rack and mpirun still works > when passing IP addresses. For some reason using the host name doesn't work > despite the fact that I can use it to ssh. FWIW, that *shouldn't* matter. Gus pointed out that you can use /etc/hosts, but Open MPI should fully be able to use names instead of IP addresses. If you're having problems with this, it makes me think that there may still be something weird in your environment, but hey, if you're ok using IP addresses and that's working -- might be good enough. :-) -- Jeff Squyres jsquy...@cisco.com _______________________________________________ users mailing list users@lists.open-mpi.org https://lists.open-mpi.org/mailman/listinfo/users