Hi,

Just tried upgrading from 2.0.1 to 2.0.2 and I'm getting error messages that look like openmpi is using ssh to login to remote nodes instead of qrsh (see below). Has anyone else noticed gridengine integration being broken, or am I being dumb?

I built with "./configure --prefix=/apps/developers/libraries/openmpi/2.0.2/1/intel-17.0.1 --with-sge --with-io-romio-flags=--with-file-system=lustre+ufs --enable-mpi-cxx --with-cma"

Can see the gridengine component via:

$ ompi_info -a | grep gridengine
                 MCA ras: gridengine (MCA v2.1.0, API v2.0.0, Component v2.0.2)
      MCA ras gridengine: ---------------------------------------------------
      MCA ras gridengine: parameter "ras_gridengine_priority" (current value: 
"100", data source: default, level: 9 dev/all, type: int)
                          Priority of the gridengine ras component
      MCA ras gridengine: parameter "ras_gridengine_verbose" (current value: 
"0", data source: default, level: 9 dev/all, type: int)
                          Enable verbose output for the gridengine ras component
      MCA ras gridengine: parameter "ras_gridengine_show_jobid" (current value: 
"false", data source: default, level: 9 dev/all, type: bool)

Cheers,

Mark

ssh_askpass: exec(/usr/libexec/openssh/ssh-askpass): No such file or directory
Permission denied, please try again.
ssh_askpass: exec(/usr/libexec/openssh/ssh-askpass): No such file or directory
Permission denied, please try again.
ssh_askpass: exec(/usr/libexec/openssh/ssh-askpass): No such file or directory
Permission denied (publickey,gssapi-keyex,gssapi-with-mic,password,hostbased).
--------------------------------------------------------------------------
ORTE was unable to reliably start one or more daemons.
This usually is caused by:

* not finding the required libraries and/or binaries on
  one or more nodes. Please check your PATH and LD_LIBRARY_PATH
  settings, or configure OMPI with --enable-orterun-prefix-by-default

* lack of authority to execute on one or more specified nodes.
  Please verify your allocation and authorities.

* the inability to write startup files into /tmp (--tmpdir/orte_tmpdir_base).
  Please check with your sys admin to determine the correct location to use.

*  compilation of the orted with dynamic libraries when static are required
  (e.g., on Cray). Please check your configure cmd line and consider using
  one of the contrib/platform definitions for your system type.

* an inability to create a connection back to mpirun due to a
  lack of common network interfaces and/or no route found between
  them. Please check network connectivity (including firewalls
  and network routing requirements).
--------------------------------------------------------------------------

_______________________________________________
users mailing list
users@lists.open-mpi.org
https://rfd.newmexicoconsortium.org/mailman/listinfo/users

Reply via email to