One very, very stupid question here. This arose over on the Slurm list actually. Those hostnames look like quite generic names, ie they are part of an HPC cluster? Do they happen to have independednt home directories for your userid? Could that possibly make a difference to the MPI launcher?
On 14 May 2018 at 06:44, Max Mellette <wmell...@ucsd.edu> wrote: > Hi Gilles, > > Thanks for the suggestions; the results are below. Any ideas where to go > from here? > > ----- Seems that selinux is not installed: > > user@b09-30:~$ sestatus > The program 'sestatus' is currently not installed. You can install it by > typing: > sudo apt install policycoreutils > > ----- Output from orted: > > user@b09-30:~$ /usr/bin/ssh -x b09-32 orted > [b09-32:197698] [[INVALID],INVALID] ORTE_ERROR_LOG: Not found in file > ess_env_module.c at line 147 > [b09-32:197698] [[INVALID],INVALID] ORTE_ERROR_LOG: Bad parameter in file > util/session_dir.c at line 106 > [b09-32:197698] [[INVALID],INVALID] ORTE_ERROR_LOG: Bad parameter in file > util/session_dir.c at line 345 > [b09-32:197698] [[INVALID],INVALID] ORTE_ERROR_LOG: Bad parameter in file > base/ess_base_std_orted.c at line 270 > -------------------------------------------------------------------------- > It looks like orte_init failed for some reason; your parallel process is > likely to abort. There are many reasons that a parallel process can > fail during orte_init; some of which are due to configuration or > environment problems. This failure appears to be an internal failure; > here's some additional information (which may only be relevant to an > Open MPI developer): > > orte_session_dir failed > --> Returned value Bad parameter (-5) instead of ORTE_SUCCESS > -------------------------------------------------------------------------- > > ----- iptables rules: > > user@b09-30:~$ sudo iptables -L > Chain INPUT (policy ACCEPT) > target prot opt source destination > ufw-before-logging-input all -- anywhere anywhere > ufw-before-input all -- anywhere anywhere > ufw-after-input all -- anywhere anywhere > ufw-after-logging-input all -- anywhere anywhere > ufw-reject-input all -- anywhere anywhere > ufw-track-input all -- anywhere anywhere > > Chain FORWARD (policy ACCEPT) > target prot opt source destination > ufw-before-logging-forward all -- anywhere anywhere > ufw-before-forward all -- anywhere anywhere > ufw-after-forward all -- anywhere anywhere > ufw-after-logging-forward all -- anywhere anywhere > ufw-reject-forward all -- anywhere anywhere > ufw-track-forward all -- anywhere anywhere > > Chain OUTPUT (policy ACCEPT) > target prot opt source destination > ufw-before-logging-output all -- anywhere anywhere > ufw-before-output all -- anywhere anywhere > ufw-after-output all -- anywhere anywhere > ufw-after-logging-output all -- anywhere anywhere > ufw-reject-output all -- anywhere anywhere > ufw-track-output all -- anywhere anywhere > > Chain ufw-after-forward (1 references) > target prot opt source destination > > Chain ufw-after-input (1 references) > target prot opt source destination > > Chain ufw-after-logging-forward (1 references) > target prot opt source destination > > Chain ufw-after-logging-input (1 references) > target prot opt source destination > > Chain ufw-after-logging-output (1 references) > target prot opt source destination > > Chain ufw-after-output (1 references) > target prot opt source destination > > Chain ufw-before-forward (1 references) > target prot opt source destination > > Chain ufw-before-input (1 references) > target prot opt source destination > > Chain ufw-before-logging-forward (1 references) > target prot opt source destination > > Chain ufw-before-logging-input (1 references) > target prot opt source destination > > Chain ufw-before-logging-output (1 references) > target prot opt source destination > > Chain ufw-before-output (1 references) > target prot opt source destination > > Chain ufw-reject-forward (1 references) > target prot opt source destination > > Chain ufw-reject-input (1 references) > target prot opt source destination > > Chain ufw-reject-output (1 references) > target prot opt source destination > > Chain ufw-track-forward (1 references) > target prot opt source destination > > Chain ufw-track-input (1 references) > target prot opt source destination > > Chain ufw-track-output (1 references) > target prot opt source destination > > > Thanks, > Max > > _______________________________________________ > users mailing list > users@lists.open-mpi.org > https://lists.open-mpi.org/mailman/listinfo/users >
_______________________________________________ users mailing list users@lists.open-mpi.org https://lists.open-mpi.org/mailman/listinfo/users