Hi

today I have installed openmpi-1.6.4rc3r27923. Unfortunately I
still have a problem with rankfiles, if I start a process on a
remote machine.


tyr rankfiles 114  ssh linpc1 ompi_info | grep "Open MPI:"
                Open MPI: 1.6.4rc3r27923

tyr rankfiles 115 cat rf_linpc1
rank 0=linpc1 slot=0:0-1,1:0-1

tyr rankfiles 116 mpiexec -report-bindings -np 1 \
  -rf rf_linpc1 hostname
------------------------------------------------------------------
All nodes which are allocated for this job are already filled.
------------------------------------------------------------------


The following command still works.

tyr rankfiles 119 mpiexec -report-bindings -np 1 -host linpc1 \
  -cpus-per-proc 4 -bycore -bind-to-core hostname
[linpc1:32262] MCW rank 0 bound to socket 0[core 0-1]
  socket 1[core 0-1]: [B B][B B]
linpc1
tyr rankfiles 120 


Everything is fine, if I use the rankfile on the local machine.

linpc1 rankfiles 103 ompi_info | grep "Open MPI:"                               
               
 Open MPI: 1.6.4rc3r27923

linpc1 rankfiles 104 cat rf_linpc1
rank 0=linpc1 slot=0:0-1,1:0-1

linpc1 rankfiles 105 mpiexec -report-bindings -np 1 \
  -rf rf_linpc1 hostname
[linpc1:32385] MCW rank 0 bound to socket 0[core 0-1]
  socket 1[core 0-1]: [B B][B B] (slot list 0:0-1,1:0-1)
linpc1
linpc1 rankfiles 106


In my opinion it should also work if I start a process on a
remote machine. Can somebody look once more into this issue?
Thank you very much for your help in advance.


Kind regards

Siegmar

Reply via email to