Hello Guys

I have a cluster of HPC and I update OFED, Firmware etc.

Post reboot and run  mpirun -machinefile nodes8 -n 128
/home/HPL/run_hpl/xhpl show the following error

bash: /usr/mpi/gcc/openmpi-1.8.8/bin/orted: No such file or directory
bash: /usr/mpi/gcc/openmpi-1.8.8/bin/orted: No such file or directory
bash: /usr/mpi/gcc/openmpi-1.8.8/bin/orted: No such file or directory
--------------------------------------------------------------------------
ORTE was unable to reliably start one or more daemons.
This usually is caused by:

* not finding the required libraries and/or binaries on
  one or more nodes. Please check your PATH and LD_LIBRARY_PATH
  settings, or configure OMPI with --enable-orterun-prefix-by-default

* lack of authority to execute on one or more specified nodes.
  Please verify your allocation and authorities.

* the inability to write startup files into /tmp
(--tmpdir/orte_tmpdir_base).
  Please check with your sys admin to determine the correct location to use.

*  compilation of the orted with dynamic libraries when static are required
  (e.g., on Cray). Please check your configure cmd line and consider using
  one of the contrib/platform definitions for your system type.

* an inability to create a connection back to mpirun due to a
  lack of common network interfaces and/or no route found between
  them. Please check network connectivity (including firewalls
  and network routing requirements).



Before update I have version 1.6.4 and the cluster not show errors when I
run the mpirun

I changed the Enviroment Variables but persist the error.

Is possible ypur comment who resolved the issue.

Regards

Sebastian Antunez
_______________________________________________
users mailing list
users@lists.open-mpi.org
https://rfd.newmexicoconsortium.org/mailman/listinfo/users

Reply via email to