On a Cray XC (requiring aprun launcher to get from batch node to compute
node), 4.0.5 works but 4.1.1 and 4.1.6 do not (even on a single node). The
newer ones throw this:
--------------------------------------------------------------------------
An ORTE daemon has unexpectedly failed after launch and before
communicating back to mpirun. This could be caused by a number
of factors, including an inability to create a connection back
to mpirun due to a lack of common network interfaces and/or no
route found between them. Please check network connectivity
(including firewalls and network routing requirements).
--------------------------------------------------------------------------

On all 3 when I add -d to mpirun, they show aprun is being called. However,
the 2 newer versions add an invalid flag: -L. Doesn't matter if the -L is
followed by a batch node name or a compute node name.

4.0.5:
[batch7:78642] plm:alps:     aprun -n 1 -N 1 -cc none -e
PMI_NO_PREINITIALIZE=1 -e PMI_NO_FORK=1 -e OMPI_NO_USE_CRAY_PMI=1
orted -mca orte_debug 1 -mca ess_base_jobid 3787849728 -mca ess_base_vpid 1
-mca ess_base_num_procs 2 -mca orte_node_regex batch[1:7],[3:132]@0(2) -mca
orte_hnp_uri 3787849728.0;tcp://10.128.13.251:34149

4.1.1:
[batch7:75094] plm:alps:     aprun -n 1 -N 1 -cc none -e
PMI_NO_PREINITIALIZE=1 -e PMI_NO_FORK=1 -e OMPI_NO_USE_CRAY_PMI=1 -L batch7
orted -mca orte_debug 1 -mca ess_base_jobid 4154589184 -mca ess_base_vpid 1
-mca ess_base_num_procs 2 -mca orte_node_regex mpirun,batch[1:7]@0(2) -mca
orte_hnp_uri 4154589184.0;tcp://10.128.13.251:56589
aprun: -L node_list contains an invalid entry

4.1.6:
[batch20:43065] plm:alps:     aprun -n 1 -N 1 -cc none -e
PMI_NO_PREINITIALIZE=1 -e PMI_NO_FORK=1 -e OMPI_NO_USE_CRAY_PMI=1 -L
nid00140 orted -mca orte_debug 1 -mca ess_base_jobid 115474432 -mca
ess_base_vpid 1 -mca ess_base_num_procs 2 -mca orte_node_regex
batch[2:20],nid[5:140]@0(2) -mca orte_hnp_uri
115474432.0;tcp://10.128.1.39:51455
aprun: -L node_list contains an invalid entry

How can I get this -L argument removed?

Thanks, Chris

Attachment: smime.p7s
Description: S/MIME cryptographic signature

Reply via email to