On Feb 7, 2014, at 11:33 PM, Siegmar Gross 
<siegmar.gr...@informatik.hs-fulda.de> wrote:

> Hi,
> 
> today I tested rankfiles once more. The good news first: openmpi-1.7.4
> now supports my Sun M4000 server with Sparc VII processors on the
> command line.
> 
> rs0 openmpi_1.7.x_or_newer 104 mpiexec --report-bindings -np 4 \
>  --bind-to hwthread hostname
> [rs0.informatik.hs-fulda.de:06051] MCW rank 1 bound to
>  socket 0[core 1[hwt 0]]: [../B./../..][../../../..]
> [rs0.informatik.hs-fulda.de:06051] MCW rank 2 bound to
>  socket 1[core 4[hwt 0]]: [../../../..][B./../../..]
> [rs0.informatik.hs-fulda.de:06051] MCW rank 3 bound to
>  socket 1[core 5[hwt 0]]: [../../../..][../B./../..]
> [rs0.informatik.hs-fulda.de:06051] MCW rank 0 bound to
>  socket 0[core 0[hwt 0]]: [B./../../..][../../../..]
> rs0.informatik.hs-fulda.de
> rs0.informatik.hs-fulda.de
> rs0.informatik.hs-fulda.de
> rs0.informatik.hs-fulda.de
> rs0 openmpi_1.7.x_or_newer 105 
> 
> Thank you very much for solving this problem. Unfortunately I still
> have a problem with a rankfile. Contents of my rankfile:
> 
> rank 0=rs0 slot=0:0-7
> rank 1=rs0 slot=1
> rank 2=rs1 slot=0
> rank 3=rs1 slot=1
> 


Here's your problem - you told us socket 0, cores 0-7. However, if you look at 
your topology, you only have *4* cores in socket 0


> 
> rs0 openmpi_1.7.x_or_newer 105 mpiexec --report-bindings \
>  --use-hwthread-cpus -np 4 -rf rf_rs0_rs1 hostname
> [rs0.informatik.hs-fulda.de:06060] [[7659,0],0] ORTE_ERROR_LOG: Not
>  found in file
>  .../openmpi-1.7.4/orte/mca/rmaps/rank_file/rmaps_rank_file.c
>  at line 283
> [rs0.informatik.hs-fulda.de:06060] [[7659,0],0] ORTE_ERROR_LOG: Not
>  found in file
>  .../openmpi-1.7.4/orte/mca/rmaps/base/rmaps_base_map_job.c
>  at line 284
> rs0 openmpi_1.7.x_or_newer 106 
> 
> 
> rs0 openmpi_1.7.x_or_newer 110 mpiexec --report-bindings \
>  --display-allocation --mca rmaps_base_verbose_100 \
>  --use-hwthread-cpus -np 4 -rf rf_rs0_rs1 hostname
> 
> ======================   ALLOCATED NODES   ======================
>        rs0: slots=2 max_slots=0 slots_inuse=0
>        rs1: slots=2 max_slots=0 slots_inuse=0
> =================================================================
> [rs0.informatik.hs-fulda.de:06074] [[7677,0],0] ORTE_ERROR_LOG: Not found in 
> file 
> ../../../../../openmpi-1.7.4/orte/mca/rmaps/rank_file/rmaps_rank_file.c at 
> line 283
> [rs0.informatik.hs-fulda.de:06074] [[7677,0],0] ORTE_ERROR_LOG: Not found in 
> file 
> ../../../../openmpi-1.7.4/orte/mca/rmaps/base/rmaps_base_map_job.c at line 284
> rs0 openmpi_1.7.x_or_newer 111 
> 
> 
> rs0 openmpi_1.7.x_or_newer 111 mpiexec --report-bindings --display-allocation 
> --mca ess_base_verbose 5 --use-hwthread-cpus -np 
> 4 -rf rf_rs0_rs1 hostname
> [rs0.informatik.hs-fulda.de:06078] mca:base:select:(  ess) Querying component 
> [env]
> [rs0.informatik.hs-fulda.de:06078] mca:base:select:(  ess) Skipping component 
> [env]. Query failed to return a module
> [rs0.informatik.hs-fulda.de:06078] mca:base:select:(  ess) Querying component 
> [hnp]
> [rs0.informatik.hs-fulda.de:06078] mca:base:select:(  ess) Query of component 
> [hnp] set priority to 100
> [rs0.informatik.hs-fulda.de:06078] mca:base:select:(  ess) Querying component 
> [singleton]
> [rs0.informatik.hs-fulda.de:06078] mca:base:select:(  ess) Skipping component 
> [singleton]. Query failed to return a module
> [rs0.informatik.hs-fulda.de:06078] mca:base:select:(  ess) Querying component 
> [tool]
> [rs0.informatik.hs-fulda.de:06078] mca:base:select:(  ess) Skipping component 
> [tool]. Query failed to return a module
> [rs0.informatik.hs-fulda.de:06078] mca:base:select:(  ess) Selected component 
> [hnp]
> [rs0.informatik.hs-fulda.de:06078] [[INVALID],INVALID] Topology Info:
> [rs0.informatik.hs-fulda.de:06078] Type: Machine Number of child objects: 1
>        Name=NULL
>        total=33554432KB
>        Backend=Solaris
>        OSName=SunOS
>        OSRelease=5.10
>        OSVersion=Generic_150400-04
>        Architecture=sun4u
>        Cpuset:  0x0000ffff
>        Online:  0x0000ffff
>        Allowed: 0x0000ffff
>        Bind CPU proc:   TRUE
>        Bind CPU thread: TRUE
>        Bind MEM proc:   TRUE
>        Bind MEM thread: TRUE
>        Type: NUMANode Number of child objects: 2
>                Name=NULL
>                local=33554432KB
>                total=33554432KB
>                Cpuset:  0x0000ffff
>                Online:  0x0000ffff
>                Allowed: 0x0000ffff
>                Type: Socket Number of child objects: 4
>                        Name=NULL
>                        CPUType=sparcv9
>                        CPUModel=SPARC64_VII
>                        Cpuset:  0x000000ff
>                        Online:  0x000000ff
>                        Allowed: 0x000000ff
>                        Type: Core Number of child objects: 2
>                                Name=NULL
>                                Cpuset:  0x00000003
>                                Online:  0x00000003
>                                Allowed: 0x00000003
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000001
>                                        Online:  0x00000001
>                                        Allowed: 0x00000001
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000002
>                                        Online:  0x00000002
>                                        Allowed: 0x00000002
>                        Type: Core Number of child objects: 2
>                                Name=NULL
>                                Cpuset:  0x0000000c
>                                Online:  0x0000000c
>                                Allowed: 0x0000000c
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000004
>                                        Online:  0x00000004
>                                        Allowed: 0x00000004
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000008
>                                        Online:  0x00000008
>                                        Allowed: 0x00000008
>                        Type: Core Number of child objects: 2
>                                Name=NULL
>                                Cpuset:  0x00000030
>                                Online:  0x00000030
>                                Allowed: 0x00000030
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000010
>                                        Online:  0x00000010
>                                        Allowed: 0x00000010
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000020
>                                        Online:  0x00000020
>                                        Allowed: 0x00000020
>                        Type: Core Number of child objects: 2
>                                Name=NULL
>                                Cpuset:  0x000000c0
>                                Online:  0x000000c0
>                                Allowed: 0x000000c0
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000040
>                                        Online:  0x00000040
>                                        Allowed: 0x00000040
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000080
>                                        Online:  0x00000080
>                                        Allowed: 0x00000080
>                Type: Socket Number of child objects: 4
>                        Name=NULL
>                        CPUType=sparcv9
>                        CPUModel=SPARC64_VII
>                        Cpuset:  0x0000ff00
>                        Online:  0x0000ff00
>                        Allowed: 0x0000ff00
>                        Type: Core Number of child objects: 2
>                                Name=NULL
>                                Cpuset:  0x00000300
>                                Online:  0x00000300
>                                Allowed: 0x00000300
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000100
>                                        Online:  0x00000100
>                                        Allowed: 0x00000100
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000200
>                                        Online:  0x00000200
>                                        Allowed: 0x00000200
>                        Type: Core Number of child objects: 2
>                                Name=NULL
>                                Cpuset:  0x00000c00
>                                Online:  0x00000c00
>                                Allowed: 0x00000c00
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000400
>                                        Online:  0x00000400
>                                        Allowed: 0x00000400
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000800
>                                        Online:  0x00000800
>                                        Allowed: 0x00000800
>                        Type: Core Number of child objects: 2
>                                Name=NULL
>                                Cpuset:  0x00003000
>                                Online:  0x00003000
>                                Allowed: 0x00003000
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00001000
>                                        Online:  0x00001000
>                                        Allowed: 0x00001000
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00002000
>                                        Online:  0x00002000
>                                        Allowed: 0x00002000
>                        Type: Core Number of child objects: 2
>                                Name=NULL
>                                Cpuset:  0x0000c000
>                                Online:  0x0000c000
>                                Allowed: 0x0000c000
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00004000
>                                        Online:  0x00004000
>                                        Allowed: 0x00004000
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00008000
>                                        Online:  0x00008000
>                                        Allowed: 0x00008000
> [rs1.informatik.hs-fulda.de:09657] mca:base:select:(  ess) Querying component 
> [env]
> [rs1.informatik.hs-fulda.de:09657] mca:base:select:(  ess) Query of component 
> [env] set priority to 20
> [rs1.informatik.hs-fulda.de:09657] mca:base:select:(  ess) Selected component 
> [env]
> [rs1.informatik.hs-fulda.de:09657] ess:env set name to [[7673,0],1]
> [rs1.informatik.hs-fulda.de:09657] [[7673,0],1] Topology Info:
> [rs1.informatik.hs-fulda.de:09657] Type: Machine Number of child objects: 1
>        Name=NULL
>        total=33554432KB
>        Backend=Solaris
>        OSName=SunOS
>        OSRelease=5.10
>        OSVersion=Generic_150400-04
>        Architecture=sun4u
>        Cpuset:  0x0000ffff
>        Online:  0x0000ffff
>        Allowed: 0x0000ffff
>        Bind CPU proc:   TRUE
>        Bind CPU thread: TRUE
>        Bind MEM proc:   TRUE
>        Bind MEM thread: TRUE
>        Type: NUMANode Number of child objects: 2
>                Name=NULL
>                local=33554432KB
>                total=33554432KB
>                Cpuset:  0x0000ffff
>                Online:  0x0000ffff
>                Allowed: 0x0000ffff
>                Type: Socket Number of child objects: 4
>                        Name=NULL
>                        CPUType=sparcv9
>                        CPUModel=SPARC64_VII
>                        Cpuset:  0x000000ff
>                        Online:  0x000000ff
>                        Allowed: 0x000000ff
>                        Type: Core Number of child objects: 2
>                                Name=NULL
>                                Cpuset:  0x00000003
>                                Online:  0x00000003
>                                Allowed: 0x00000003
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000001
>                                        Online:  0x00000001
>                                        Allowed: 0x00000001
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000002
>                                        Online:  0x00000002
>                                        Allowed: 0x00000002
>                        Type: Core Number of child objects: 2
>                                Name=NULL
>                                Cpuset:  0x0000000c
>                                Online:  0x0000000c
>                                Allowed: 0x0000000c
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000004
>                                        Online:  0x00000004
>                                        Allowed: 0x00000004
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000008
>                                        Online:  0x00000008
>                                        Allowed: 0x00000008
>                        Type: Core Number of child objects: 2
>                                Name=NULL
>                                Cpuset:  0x00000030
>                                Online:  0x00000030
>                                Allowed: 0x00000030
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000010
>                                        Online:  0x00000010
>                                        Allowed: 0x00000010
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000020
>                                        Online:  0x00000020
>                                        Allowed: 0x00000020
>                        Type: Core Number of child objects: 2
>                                Name=NULL
>                                Cpuset:  0x000000c0
>                                Online:  0x000000c0
>                                Allowed: 0x000000c0
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000040
>                                        Online:  0x00000040
>                                        Allowed: 0x00000040
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000080
>                                        Online:  0x00000080
>                                        Allowed: 0x00000080
>                Type: Socket Number of child objects: 4
>                        Name=NULL
>                        CPUType=sparcv9
>                        CPUModel=SPARC64_VII
>                        Cpuset:  0x0000ff00
>                        Online:  0x0000ff00
>                        Allowed: 0x0000ff00
>                        Type: Core Number of child objects: 2
>                                Name=NULL
>                                Cpuset:  0x00000300
>                                Online:  0x00000300
>                                Allowed: 0x00000300
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000100
>                                        Online:  0x00000100
>                                        Allowed: 0x00000100
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000200
>                                        Online:  0x00000200
>                                        Allowed: 0x00000200
>                        Type: Core Number of child objects: 2
>                                Name=NULL
>                                Cpuset:  0x00000c00
>                                Online:  0x00000c00
>                                Allowed: 0x00000c00
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000400
>                                        Online:  0x00000400
>                                        Allowed: 0x00000400
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000800
>                                        Online:  0x00000800
>                                        Allowed: 0x00000800
>                        Type: Core Number of child objects: 2
>                                Name=NULL
>                                Cpuset:  0x00003000
>                                Online:  0x00003000
>                                Allowed: 0x00003000
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00001000
>                                        Online:  0x00001000
>                                        Allowed: 0x00001000
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00002000
>                                        Online:  0x00002000
>                                        Allowed: 0x00002000
>                        Type: Core Number of child objects: 2
>                                Name=NULL
>                                Cpuset:  0x0000c000
>                                Online:  0x0000c000
>                                Allowed: 0x0000c000
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00004000
>                                        Online:  0x00004000
>                                        Allowed: 0x00004000
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00008000
>                                        Online:  0x00008000
>                                        Allowed: 0x00008000
> 
> ======================   ALLOCATED NODES   ======================
>        rs0: slots=2 max_slots=0 slots_inuse=0
>        rs1: slots=2 max_slots=0 slots_inuse=0
> =================================================================
> [rs0.informatik.hs-fulda.de:06078] [[7673,0],0] ORTE_ERROR_LOG: Not found in 
> file 
> ../../../../../openmpi-1.7.4/orte/mca/rmaps/rank_file/rmaps_rank_file.c at 
> line 283
> [rs0.informatik.hs-fulda.de:06078] [[7673,0],0] ORTE_ERROR_LOG: Not found in 
> file 
> ../../../../openmpi-1.7.4/orte/mca/rmaps/base/rmaps_base_map_job.c at line 284
> [rs1.informatik.hs-fulda.de:09657] [[7673,0],1] setting up session dir with
>        tmpdir: UNDEF
>        host rs1
> rs0 openmpi_1.7.x_or_newer 112 
> 
> 
> 
> 
> rs0 openmpi_1.7.x_or_newer 113 mpiexec --report-bindings --display-allocation 
> --mca plm_base_verbose 100 --use-hwthread-cpus 
> -np 4 -rf rf_rs0_rs1 hostname
> [rs0.informatik.hs-fulda.de:06088] mca: base: components_register: 
> registering plm components
> [rs0.informatik.hs-fulda.de:06088] mca: base: components_register: found 
> loaded component rsh
> [rs0.informatik.hs-fulda.de:06088] mca: base: components_register: component 
> rsh register function successful
> [rs0.informatik.hs-fulda.de:06088] mca: base: components_open: opening plm 
> components
> [rs0.informatik.hs-fulda.de:06088] mca: base: components_open: found loaded 
> component rsh
> [rs0.informatik.hs-fulda.de:06088] mca: base: components_open: component rsh 
> open function successful
> [rs0.informatik.hs-fulda.de:06088] mca:base:select: Auto-selecting plm 
> components
> [rs0.informatik.hs-fulda.de:06088] mca:base:select:(  plm) Querying component 
> [rsh]
> [rs0.informatik.hs-fulda.de:06088] [[INVALID],INVALID] plm:rsh_lookup on 
> agent ssh : rsh path NULL
> [rs0.informatik.hs-fulda.de:06088] mca:base:select:(  plm) Query of component 
> [rsh] set priority to 10
> [rs0.informatik.hs-fulda.de:06088] mca:base:select:(  plm) Selected component 
> [rsh]
> [rs0.informatik.hs-fulda.de:06088] plm:base:set_hnp_name: initial bias 6088 
> nodename hash 3909477186
> [rs0.informatik.hs-fulda.de:06088] plm:base:set_hnp_name: final jobfam 7567
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] plm:rsh_setup on agent ssh : 
> rsh path NULL
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] plm:base:receive start comm
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] plm:base:setup_job
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] plm:base:setup_vm
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] plm:base:setup_vm creating map
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] setup:vm: working unmanaged 
> allocation
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] using rankfile rf_rs0_rs1
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] checking node rs0
> 
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] ignoring myself
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] checking node rs1
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] plm:base:setup_vm add new 
> daemon [[7567,0],1]
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] plm:base:setup_vm assigning 
> new daemon [[7567,0],1] to node rs1
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] plm:rsh: launching vm
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] plm:rsh: local shell: 2 (tcsh)
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] plm:rsh: assuming same remote 
> shell as local shell
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] plm:rsh: remote shell: 2 
> (tcsh)
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] plm:rsh: final template argv:
>        /usr/local/bin/ssh <template>  orted -mca orte_report_bindings 1 -mca 
> ess env -mca orte_ess_jobid 495910912 -mca 
> orte_ess_vpid <template> -mca orte_ess_num_procs 2 -mca orte_hnp_uri 
> "495910912.0;tcp://193.174.26.198,192.168.128.1,10.1.1.2:43810" --tree-spawn 
> --mca plm_base_verbose 100 -mca plm rsh -mca 
> orte_rankfile rf_rs0_rs1 -mca hwloc_base_use_hwthreads_as_cpus 1 -mca 
> orte_display_alloc 1 -mca hwloc_base_report_bindings 1
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] plm:rsh:launch daemon 0 not a 
> child of mine
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] plm:rsh: adding node rs1 to 
> launch list
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] plm:rsh: activating launch 
> event
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] plm:rsh: recording launch of 
> daemon [[7567,0],1]
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] plm:rsh: executing: 
> (/usr/local/bin/ssh) [/usr/local/bin/ssh rs1  orted -mca 
> orte_report_bindings 1 -mca ess env -mca orte_ess_jobid 495910912 -mca 
> orte_ess_vpid 1 -mca orte_ess_num_procs 2 -mca 
> orte_hnp_uri "495910912.0;tcp://193.174.26.198,192.168.128.1,10.1.1.2:43810" 
> --tree-spawn --mca plm_base_verbose 100 -mca plm 
> rsh -mca orte_rankfile rf_rs0_rs1 -mca hwloc_base_use_hwthreads_as_cpus 1 
> -mca orte_display_alloc 1 -mca 
> hwloc_base_report_bindings 1]
> Warning: untrusted X11 forwarding setup failed: xauth key data not generated
> Warning: No xauth data; using fake authentication data for X11 forwarding.
> [rs1.informatik.hs-fulda.de:09721] mca: base: components_register: 
> registering plm components
> [rs1.informatik.hs-fulda.de:09721] mca: base: components_register: found 
> loaded component rsh
> [rs1.informatik.hs-fulda.de:09721] mca: base: components_register: component 
> rsh register function successful
> [rs1.informatik.hs-fulda.de:09721] mca: base: components_open: opening plm 
> components
> [rs1.informatik.hs-fulda.de:09721] mca: base: components_open: found loaded 
> component rsh
> [rs1.informatik.hs-fulda.de:09721] mca: base: components_open: component rsh 
> open function successful
> [rs1.informatik.hs-fulda.de:09721] mca:base:select: Auto-selecting plm 
> components
> [rs1.informatik.hs-fulda.de:09721] mca:base:select:(  plm) Querying component 
> [rsh]
> [rs1.informatik.hs-fulda.de:09721] [[7567,0],1] plm:rsh_lookup on agent ssh : 
> rsh path NULL
> [rs1.informatik.hs-fulda.de:09721] mca:base:select:(  plm) Query of component 
> [rsh] set priority to 10
> [rs1.informatik.hs-fulda.de:09721] mca:base:select:(  plm) Selected component 
> [rsh]
> [rs1.informatik.hs-fulda.de:09721] [[7567,0],1] plm:rsh_setup on agent ssh : 
> rsh path NULL
> [rs1.informatik.hs-fulda.de:09721] [[7567,0],1] plm:base:receive start comm
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] plm:base:orted_report_launch 
> from daemon [[7567,0],1]
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] plm:base:orted_report_launch 
> from daemon [[7567,0],1] on node rs1
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] RECEIVED TOPOLOGY FROM NODE 
> rs1
> [rs0.informatik.hs-fulda.de:06088] Type: Machine Number of child objects: 1
>        Name=NULL
>        total=33554432KB
>        Backend=Solaris
>        OSName=SunOS
>        OSRelease=5.10
>        OSVersion=Generic_150400-04
>        Architecture=sun4u
>        Cpuset:  0x0000ffff
>        Online:  0x0000ffff
>        Allowed: 0x0000ffff
>        Bind CPU proc:   TRUE
>        Bind CPU thread: TRUE
>        Bind MEM proc:   TRUE
>        Bind MEM thread: TRUE
>        Type: NUMANode Number of child objects: 2
>                Name=NULL
>                local=33554432KB
>                total=33554432KB
>                Cpuset:  0x0000ffff
>                Online:  0x0000ffff
>                Allowed: 0x0000ffff
>                Type: Socket Number of child objects: 4
>                        Name=NULL
>                        CPUType=sparcv9
>                        CPUModel=SPARC64_VII
>                        Cpuset:  0x000000ff
>                        Online:  0x000000ff
>                        Allowed: 0x000000ff
>                        Type: Core Number of child objects: 2
>                                Name=NULL
>                                Cpuset:  0x00000003
>                                Online:  0x00000003
>                                Allowed: 0x00000003
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000001
>                                        Online:  0x00000001
>                                        Allowed: 0x00000001
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000002
>                                        Online:  0x00000002
>                                        Allowed: 0x00000002
>                        Type: Core Number of child objects: 2
>                                Name=NULL
>                                Cpuset:  0x0000000c
>                                Online:  0x0000000c
>                                Allowed: 0x0000000c
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000004
>                                        Online:  0x00000004
>                                        Allowed: 0x00000004
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000008
>                                        Online:  0x00000008
>                                        Allowed: 0x00000008
>                        Type: Core Number of child objects: 2
>                                Name=NULL
>                                Cpuset:  0x00000030
>                                Online:  0x00000030
>                                Allowed: 0x00000030
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000010
>                                        Online:  0x00000010
>                                        Allowed: 0x00000010
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000020
>                                        Online:  0x00000020
>                                        Allowed: 0x00000020
>                        Type: Core Number of child objects: 2
>                                Name=NULL
>                                Cpuset:  0x000000c0
>                                Online:  0x000000c0
>                                Allowed: 0x000000c0
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000040
>                                        Online:  0x00000040
>                                        Allowed: 0x00000040
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000080
>                                        Online:  0x00000080
>                                        Allowed: 0x00000080
>                Type: Socket Number of child objects: 4
>                        Name=NULL
>                        CPUType=sparcv9
>                        CPUModel=SPARC64_VII
>                        Cpuset:  0x0000ff00
>                        Online:  0x0000ff00
>                        Allowed: 0x0000ff00
>                        Type: Core Number of child objects: 2
>                                Name=NULL
>                                Cpuset:  0x00000300
>                                Online:  0x00000300
>                                Allowed: 0x00000300
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000100
>                                        Online:  0x00000100
>                                        Allowed: 0x00000100
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000200
>                                        Online:  0x00000200
>                                        Allowed: 0x00000200
>                        Type: Core Number of child objects: 2
>                                Name=NULL
>                                Cpuset:  0x00000c00
>                                Online:  0x00000c00
>                                Allowed: 0x00000c00
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000400
>                                        Online:  0x00000400
>                                        Allowed: 0x00000400
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00000800
>                                        Online:  0x00000800
>                                        Allowed: 0x00000800
>                        Type: Core Number of child objects: 2
>                                Name=NULL
>                                Cpuset:  0x00003000
>                                Online:  0x00003000
>                                Allowed: 0x00003000
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00001000
>                                        Online:  0x00001000
>                                        Allowed: 0x00001000
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00002000
>                                        Online:  0x00002000
>                                        Allowed: 0x00002000
>                        Type: Core Number of child objects: 2
>                                Name=NULL
>                                Cpuset:  0x0000c000
>                                Online:  0x0000c000
>                                Allowed: 0x0000c000
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00004000
>                                        Online:  0x00004000
>                                        Allowed: 0x00004000
>                                Type: PU Number of child objects: 0
>                                        Name=NULL
>                                        Cpuset:  0x00008000
>                                        Online:  0x00008000
>                                        Allowed: 0x00008000
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] TOPOLOGY MATCHES - DISCARDING
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] plm:base:orted_report_launch 
> completed for daemon [[7567,0],1] at contact 
> 495910912.1;tcp://193.174.26.199,192.168.128.2,10.1.1.2:37231
> 
> ======================   ALLOCATED NODES   ======================
>        rs0: slots=2 max_slots=0 slots_inuse=0
>        rs1: slots=2 max_slots=0 slots_inuse=0
> =================================================================
> [rs1.informatik.hs-fulda.de:09721] [[7567,0],1] plm:rsh: remote spawn called
> [rs1.informatik.hs-fulda.de:09721] [[7567,0],1] plm:rsh: remote spawn - have 
> no children!
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] ORTE_ERROR_LOG: Not found in 
> file 
> ../../../../../openmpi-1.7.4/orte/mca/rmaps/rank_file/rmaps_rank_file.c at 
> line 283
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] ORTE_ERROR_LOG: Not found in 
> file 
> ../../../../openmpi-1.7.4/orte/mca/rmaps/base/rmaps_base_map_job.c at line 284
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] plm:base:orted_cmd sending 
> orted_exit commands
> [rs0.informatik.hs-fulda.de:06088] [[7567,0],0] plm:base:receive stop comm
> [rs0.informatik.hs-fulda.de:06088] mca: base: close: component rsh closed
> [rs0.informatik.hs-fulda.de:06088] mca: base: close: unloading component rsh
> [rs1.informatik.hs-fulda.de:09721] [[7567,0],1] plm:base:receive stop comm
> [rs1.informatik.hs-fulda.de:09721] mca: base: close: component rsh closed
> [rs1.informatik.hs-fulda.de:09721] mca: base: close: unloading component rsh
> rs0 openmpi_1.7.x_or_newer 114 
> 
> 
> 
> 
> I still have the problem that I get no output if I mix little and
> big endian machines, which works for openmpi-1.6.x.
> 
> linpc1 openmpi_1.7.x_or_newer 112 mpiexec -report-bindings -np 4 \
>  -rf rf_linpc_sunpc_tyr hostname
> linpc1 openmpi_1.7.x_or_newer 113 
> 
> 
> 
> linpc1 openmpi_1.7.x_or_newer 188 mpiexec -report-bindings 
> --display-allocation --mca plm_base_verbose 100 -np 1 -rf 
> rf_linpc_sunpc_tyr hostname
> [linpc1:20650] mca: base: components_register: registering plm components
> [linpc1:20650] mca: base: components_register: found loaded component rsh
> [linpc1:20650] mca: base: components_register: component rsh register 
> function successful
> [linpc1:20650] mca: base: components_register: found loaded component slurm
> [linpc1:20650] mca: base: components_register: component slurm register 
> function successful
> [linpc1:20650] mca: base: components_open: opening plm components
> [linpc1:20650] mca: base: components_open: found loaded component rsh
> [linpc1:20650] mca: base: components_open: component rsh open function 
> successful
> [linpc1:20650] mca: base: components_open: found loaded component slurm
> [linpc1:20650] mca: base: components_open: component slurm open function 
> successful
> [linpc1:20650] mca:base:select: Auto-selecting plm components
> [linpc1:20650] mca:base:select:(  plm) Querying component [rsh]
> [linpc1:20650] [[INVALID],INVALID] plm:rsh_lookup on agent ssh : rsh path NULL
> [linpc1:20650] mca:base:select:(  plm) Query of component [rsh] set priority 
> to 10
> [linpc1:20650] mca:base:select:(  plm) Querying component [slurm]
> [linpc1:20650] mca:base:select:(  plm) Skipping component [slurm]. Query 
> failed to return a module
> [linpc1:20650] mca:base:select:(  plm) Selected component [rsh]
> [linpc1:20650] mca: base: close: component slurm closed
> [linpc1:20650] mca: base: close: unloading component slurm
> [linpc1:20650] plm:base:set_hnp_name: initial bias 20650 nodename hash 
> 3902177415
> [linpc1:20650] plm:base:set_hnp_name: final jobfam 14523
> [linpc1:20650] [[14523,0],0] plm:rsh_setup on agent ssh : rsh path NULL
> [linpc1:20650] [[14523,0],0] plm:base:receive start comm
> [linpc1:20650] [[14523,0],0] plm:base:setup_job
> [linpc1:20650] [[14523,0],0] plm:base:setup_vm
> [linpc1:20650] [[14523,0],0] plm:base:setup_vm creating map
> [linpc1:20650] [[14523,0],0] setup:vm: working unmanaged allocation
> [linpc1:20650] [[14523,0],0] using rankfile rf_linpc_sunpc_tyr
> [linpc1:20650] [[14523,0],0] checking node linpc0
> [linpc1:20650] [[14523,0],0] checking node linpc1
> [linpc1:20650] [[14523,0],0] ignoring myself
> [linpc1:20650] [[14523,0],0] checking node sunpc1
> [linpc1:20650] [[14523,0],0] checking node tyr
> [linpc1:20650] [[14523,0],0] plm:base:setup_vm add new daemon [[14523,0],1]
> [linpc1:20650] [[14523,0],0] plm:base:setup_vm assigning new daemon 
> [[14523,0],1] to node linpc0
> [linpc1:20650] [[14523,0],0] plm:base:setup_vm add new daemon [[14523,0],2]
> [linpc1:20650] [[14523,0],0] plm:base:setup_vm assigning new daemon 
> [[14523,0],2] to node sunpc1
> [linpc1:20650] [[14523,0],0] plm:base:setup_vm add new daemon [[14523,0],3]
> [linpc1:20650] [[14523,0],0] plm:base:setup_vm assigning new daemon 
> [[14523,0],3] to node tyr
> [linpc1:20650] [[14523,0],0] plm:rsh: launching vm
> [linpc1:20650] [[14523,0],0] plm:rsh: local shell: 2 (tcsh)
> [linpc1:20650] [[14523,0],0] plm:rsh: assuming same remote shell as local 
> shell
> [linpc1:20650] [[14523,0],0] plm:rsh: remote shell: 2 (tcsh)
> [linpc1:20650] [[14523,0],0] plm:rsh: final template argv:
>        /usr/local/bin/ssh <template>  orted -mca orte_report_bindings 1 -mca 
> ess env -mca orte_ess_jobid 951779328 -mca 
> orte_ess_vpid <template> -mca orte_ess_num_procs 4 -mca orte_hnp_uri 
> "951779328.0;tcp://193.174.26.208:46876" --tree-spawn 
> --mca plm_base_verbose 100 -mca plm rsh -mca hwloc_base_report_bindings 1 
> -mca orte_display_alloc 1 -mca orte_rankfile 
> rf_linpc_sunpc_tyr
> [linpc1:20650] [[14523,0],0] plm:rsh:launch daemon 0 not a child of mine
> [linpc1:20650] [[14523,0],0] plm:rsh: adding node linpc0 to launch list
> [linpc1:20650] [[14523,0],0] plm:rsh: adding node sunpc1 to launch list
> [linpc1:20650] [[14523,0],0] plm:rsh:launch daemon 3 not a child of mine
> [linpc1:20650] [[14523,0],0] plm:rsh: activating launch event
> [linpc1:20650] [[14523,0],0] plm:rsh: recording launch of daemon [[14523,0],1]
> [linpc1:20650] [[14523,0],0] plm:rsh: recording launch of daemon [[14523,0],2]
> [linpc1:20650] [[14523,0],0] plm:rsh: executing: (/usr/local/bin/ssh) 
> [/usr/local/bin/ssh sunpc1  orted -mca 
> orte_report_bindings 1 -mca ess env -mca orte_ess_jobid 951779328 -mca 
> orte_ess_vpid 2 -mca orte_ess_num_procs 4 -mca 
> orte_hnp_uri "951779328.0;tcp://193.174.26.208:46876" --tree-spawn --mca 
> plm_base_verbose 100 -mca plm rsh -mca 
> hwloc_base_report_bindings 1 -mca orte_display_alloc 1 -mca orte_rankfile 
> rf_linpc_sunpc_tyr]
> [linpc1:20650] [[14523,0],0] plm:rsh: executing: (/usr/local/bin/ssh) 
> [/usr/local/bin/ssh linpc0  orted -mca 
> orte_report_bindings 1 -mca ess env -mca orte_ess_jobid 951779328 -mca 
> orte_ess_vpid 1 -mca orte_ess_num_procs 4 -mca 
> orte_hnp_uri "951779328.0;tcp://193.174.26.208:46876" --tree-spawn --mca 
> plm_base_verbose 100 -mca plm rsh -mca 
> hwloc_base_report_bindings 1 -mca orte_display_alloc 1 -mca orte_rankfile 
> rf_linpc_sunpc_tyr]
> Warning: untrusted X11 forwarding setup failed: xauth key data not generated
> Warning: No xauth data; using fake authentication data for X11 forwarding.
> X11 forwarding request failed on channel 0
> Warning: untrusted X11 forwarding setup failed: xauth key data not generated
> Warning: No xauth data; using fake authentication data for X11 forwarding.
> [sunpc1:09408] mca: base: components_register: registering plm components
> [sunpc1:09408] mca: base: components_register: found loaded component rsh
> [sunpc1:09408] mca: base: components_register: component rsh register 
> function successful
> [sunpc1:09408] mca: base: components_open: opening plm components
> [sunpc1:09408] mca: base: components_open: found loaded component rsh
> [sunpc1:09408] mca: base: components_open: component rsh open function 
> successful
> [sunpc1:09408] mca:base:select: Auto-selecting plm components
> [sunpc1:09408] mca:base:select:(  plm) Querying component [rsh]
> [sunpc1:09408] [[14523,0],2] plm:rsh_lookup on agent ssh : rsh path NULL
> [sunpc1:09408] mca:base:select:(  plm) Query of component [rsh] set priority 
> to 10
> [sunpc1:09408] mca:base:select:(  plm) Selected component [rsh]
> [sunpc1:09408] [[14523,0],2] plm:rsh_setup on agent ssh : rsh path NULL
> [sunpc1:09408] [[14523,0],2] plm:base:receive start comm
> [linpc1:20650] [[14523,0],0] plm:base:orted_report_launch from daemon 
> [[14523,0],2]
> [linpc1:20650] [[14523,0],0] plm:base:orted_report_launch from daemon 
> [[14523,0],2] on node sunpc1
> [linpc1:20650] [[14523,0],0] plm:base:orted_report_launch completed for 
> daemon [[14523,0],2] at contact 
> 951779328.2;tcp://193.174.26.210:33215
> [sunpc1:09408] [[14523,0],2] plm:rsh: remote spawn called
> [sunpc1:09408] [[14523,0],2] plm:rsh: remote spawn - have no children!
> [linpc0:32306] mca: base: components_register: registering plm components
> [linpc0:32306] mca: base: components_register: found loaded component rsh
> [linpc0:32306] mca: base: components_register: component rsh register 
> function successful
> [linpc0:32306] mca: base: components_open: opening plm components
> [linpc0:32306] mca: base: components_open: found loaded component rsh
> [linpc0:32306] mca: base: components_open: component rsh open function 
> successful
> [linpc0:32306] mca:base:select: Auto-selecting plm components
> [linpc0:32306] mca:base:select:(  plm) Querying component [rsh]
> [linpc0:32306] [[14523,0],1] plm:rsh_lookup on agent ssh : rsh path NULL
> [linpc0:32306] mca:base:select:(  plm) Query of component [rsh] set priority 
> to 10
> [linpc0:32306] mca:base:select:(  plm) Selected component [rsh]
> [linpc0:32306] [[14523,0],1] plm:rsh_setup on agent ssh : rsh path NULL
> [linpc0:32306] [[14523,0],1] plm:base:receive start comm
> [linpc1:20650] [[14523,0],0] plm:base:orted_report_launch from daemon 
> [[14523,0],1]
> [linpc1:20650] [[14523,0],0] plm:base:orted_report_launch from daemon 
> [[14523,0],1] on node linpc0
> [linpc1:20650] [[14523,0],0] RECEIVED TOPOLOGY FROM NODE linpc0
> [linpc1:20650] Type: Machine Number of child objects: 2
>        Name=NULL
>        total=8387048KB
>        DMIProductName="Sun Ultra 40 Workstation"
>        DMIProductVersion=11
>        DMIBoardVendor="Sun Microsystems"
>        DMIBoardName="Sun Ultra 40 Workstation"
>        DMIBoardVersion=50
>        DMIBoardAssetTag=
>        DMIChassisVendor="Sun Microsystems"
>        DMIChassisType=17
>        DMIChassisVersion=01
>        DMIChassisAssetTag=
>        DMIBIOSVendor="Phoenix Technologies Ltd."
>        DMIBIOSVersion="1.70  "
>        DMIBIOSDate=02/15/2008
>        DMISysVendor="Sun Microsystems"
>        Backend=Linux
>        OSName=Linux
>        OSRelease=3.1.10-1.16-desktop
>        OSVersion="#1 SMP PREEMPT Wed Jun 27 05:21:40 UTC 2012 (d016078)"
>        Architecture=x86_64
>        Cpuset:  0x0000000f
>        Online:  0x0000000f
>        Allowed: 0x0000000f
>        Bind CPU proc:   TRUE
>        Bind CPU thread: TRUE
>        Bind MEM proc:   FALSE
>        Bind MEM thread: TRUE
>        Type: NUMANode Number of child objects: 2
>                Name=NULL
>                local=4192744KB
>                total=4192744KB
>                Cpuset:  0x00000003
>                Online:  0x00000003
>                Allowed: 0x00000003
>                Type: Socket Number of child objects: 2
>                        Name=NULL
>                        CPUModel="Dual Core AMD Opteron(tm) Processor 280"
>                        Cpuset:  0x00000003
>                        Online:  0x00000003
>                        Allowed: 0x00000003
>                        Type: L2Cache Number of child objects: 1
>                                Name=NULL
>                                size=1024KB
>                                linesize=64
>                                ways=16
>                                Cpuset:  0x00000001
>                                Online:  0x00000001
>                                Allowed: 0x00000001
>                                Type: L1dCache Number of child objects: 1
>                                        Name=NULL
>                                        size=64KB
>                                        linesize=64
>                                        ways=2
>                                        Cpuset:  0x00000001
>                                        Online:  0x00000001
>                                        Allowed: 0x00000001
>                                        Type: Core Number of child objects: 1
>                                                Name=NULL
>                                                Cpuset:  0x00000001
>                                                Online:  0x00000001
>                                                Allowed: 0x00000001
>                                                Type: PU Number of child 
> objects: 0
>                                                        Name=NULL
>                                                        Cpuset:  0x00000001
>                                                        Online:  0x00000001
>                                                        Allowed: 0x00000001
>                        Type: L2Cache Number of child objects: 1
>                                Name=NULL
>                                size=1024KB
>                                linesize=64
>                                ways=16
>                                Cpuset:  0x00000002
>                                Online:  0x00000002
>                                Allowed: 0x00000002
>                                Type: L1dCache Number of child objects: 1
>                                        Name=NULL
>                                        size=64KB
>                                        linesize=64
>                                        ways=2
>                                        Cpuset:  0x00000002
>                                        Online:  0x00000002
>                                        Allowed: 0x00000002
>                                        Type: Core Number of child objects: 1
>                                                Name=NULL
>                                                Cpuset:  0x00000002
>                                                Online:  0x00000002
>                                                Allowed: 0x00000002
>                                                Type: PU Number of child 
> objects: 0
>                                                        Name=NULL
>                                                        Cpuset:  0x00000002
>                                                        Online:  0x00000002
>                                                        Allowed: 0x00000002
>                Type: Bridge Host->PCI Number of child objects: 4
>                        Name=NULL
>                        buses=0000:[00-03]
>                        Type: PCI 10de:0053 Number of child objects: 1
>                                Name=nVidia Corporation CK804 IDE
>                                busid=0000:00:06.0
>                                class=0101(IDE)
>                                PCIVendor="nVidia Corporation"
>                                PCIDevice="CK804 IDE"
>                                Type: Block Number of child objects: 0
>                                        Name=sr0
>                        Type: PCI 10de:0055 Number of child objects: 1
>                                Name=nVidia Corporation CK804 Serial ATA 
> Controller
>                                busid=0000:00:07.0
>                                class=0101(IDE)
>                                PCIVendor="nVidia Corporation"
>                                PCIDevice="CK804 Serial ATA Controller"
>                                Type: Block Number of child objects: 0
>                                        Name=sda
>                        Type: PCI 10de:0054 Number of child objects: 0
>                                Name=nVidia Corporation CK804 Serial ATA 
> Controller
>                                busid=0000:00:08.0
>                                class=0101(IDE)
>                                PCIVendor="nVidia Corporation"
>                                PCIDevice="CK804 Serial ATA Controller"
>                        Type: PCI 10de:029d Number of child objects: 2
>                                Name=nVidia Corporation G71GL [Quadro FX 3500]
>                                busid=0000:03:00.0
>                                class=0300(VGA)
>                                PCIVendor="nVidia Corporation"
>                                PCIDevice="G71GL [Quadro FX 3500]"
>                                Type: GPU Number of child objects: 0
>                                        Name=controlD64
>                                Type: GPU Number of child objects: 0
>                                        Name=card0
>        Type: NUMANode Number of child objects: 2
>                Name=NULL
>                local=4194304KB
>                total=4194304KB
>                Cpuset:  0x0000000c
>                Online:  0x0000000c
>                Allowed: 0x0000000c
>                Type: Socket Number of child objects: 2
>                        Name=NULL
>                        CPUModel="Dual Core AMD Opteron(tm) Processor 280"
>                        Cpuset:  0x0000000c
>                        Online:  0x0000000c
>                        Allowed: 0x0000000c
>                        Type: L2Cache Number of child objects: 1
>                                Name=NULL
>                                size=1024KB
>                                linesize=64
>                                ways=16
>                                Cpuset:  0x00000004
>                                Online:  0x00000004
>                                Allowed: 0x00000004
>                                Type: L1dCache Number of child objects: 1
>                                        Name=NULL
>                                        size=64KB
>                                        linesize=64
>                                        ways=2
>                                        Cpuset:  0x00000004
>                                        Online:  0x00000004
>                                        Allowed: 0x00000004
>                                        Type: Core Number of child objects: 1
>                                                Name=NULL
>                                                Cpuset:  0x00000004
>                                                Online:  0x00000004
>                                                Allowed: 0x00000004
>                                                Type: PU Number of child 
> objects: 0
>                                                        Name=NULL
>                                                        Cpuset:  0x00000004
>                                                        Online:  0x00000004
>                                                        Allowed: 0x00000004
>                        Type: L2Cache Number of child objects: 1
>                                Name=NULL
>                                size=1024KB
>                                linesize=64
>                                ways=16
>                                Cpuset:  0x00000008
>                                Online:  0x00000008
>                                Allowed: 0x00000008
>                                Type: L1dCache Number of child objects: 1
>                                        Name=NULL
>                                        size=64KB
>                                        linesize=64
>                                        ways=2
>                                        Cpuset:  0x00000008
>                                        Online:  0x00000008
>                                        Allowed: 0x00000008
>                                        Type: Core Number of child objects: 1
>                                                Name=NULL
>                                                Cpuset:  0x00000008
>                                                Online:  0x00000008
>                                                Allowed: 0x00000008
>                                                Type: PU Number of child 
> objects: 0
>                                                        Name=NULL
>                                                        Cpuset:  0x00000008
>                                                        Online:  0x00000008
>                                                        Allowed: 0x00000008
>                Type: Bridge Host->PCI Number of child objects: 2
>                        Name=NULL
>                        buses=0000:[80-82]
>                        Type: PCI 10de:0054 Number of child objects: 0
>                                Name=nVidia Corporation CK804 Serial ATA 
> Controller
>                                busid=0000:80:07.0
>                                class=0101(IDE)
>                                PCIVendor="nVidia Corporation"
>                                PCIDevice="CK804 Serial ATA Controller"
>                        Type: PCI 10de:0055 Number of child objects: 0
>                                Name=nVidia Corporation CK804 Serial ATA 
> Controller
>                                busid=0000:80:08.0
>                                class=0101(IDE)
>                                PCIVendor="nVidia Corporation"
>                                PCIDevice="CK804 Serial ATA Controller"
> [linpc1:20650] [[14523,0],0] NEW TOPOLOGY - ADDING
> [linpc1:20650] [[14523,0],0] plm:base:orted_report_launch completed for 
> daemon [[14523,0],1] at contact 
> 951779328.1;tcp://193.174.26.214,192.168.1.1:57891
> [linpc0:32306] [[14523,0],1] plm:rsh: remote spawn called
> [linpc0:32306] [[14523,0],1] plm:rsh: local shell: 2 (tcsh)
> [linpc0:32306] [[14523,0],1] plm:rsh: assuming same remote shell as local 
> shell
> [linpc0:32306] [[14523,0],1] plm:rsh: remote shell: 2 (tcsh)
> [linpc0:32306] [[14523,0],1] plm:rsh: final template argv:
>        /usr/local/bin/ssh <template>  orted -mca orte_report_bindings 1 -mca 
> ess env -mca orte_ess_jobid 951779328 -mca 
> orte_ess_vpid <template> -mca orte_ess_num_procs 4 -mca orte_parent_uri 
> "951779328.1;tcp://193.174.26.214,192.168.1.1:57891" 
> -mca orte_hnp_uri "951779328.0;tcp://193.174.26.208:46876" --mca 
> plm_base_verbose 100 -mca hwloc_base_report_bindings 1 -mca 
> orte_display_alloc 1 -mca orte_rankfile rf_linpc_sunpc_tyr -mca plm rsh
> [linpc0:32306] [[14523,0],1] plm:rsh: activating launch event
> [linpc0:32306] [[14523,0],1] plm:rsh: recording launch of daemon [[14523,0],3]
> [linpc0:32306] [[14523,0],1] plm:rsh: executing: (/usr/local/bin/ssh) 
> [/usr/local/bin/ssh tyr  orted -mca orte_report_bindings 
> 1 -mca ess env -mca orte_ess_jobid 951779328 -mca orte_ess_vpid 3 -mca 
> orte_ess_num_procs 4 -mca orte_parent_uri 
> "951779328.1;tcp://193.174.26.214,192.168.1.1:57891" -mca orte_hnp_uri 
> "951779328.0;tcp://193.174.26.208:46876" --mca 
> plm_base_verbose 100 -mca hwloc_base_report_bindings 1 -mca 
> orte_display_alloc 1 -mca orte_rankfile rf_linpc_sunpc_tyr -mca 
> plm rsh --tree-spawn]
> Warning: untrusted X11 forwarding setup failed: xauth key data not generated
> Warning: No xauth data; using fake authentication data for X11 forwarding.
> [tyr.informatik.hs-fulda.de:23227] mca: base: components_register: 
> registering plm components
> [tyr.informatik.hs-fulda.de:23227] mca: base: components_register: found 
> loaded component rsh
> [tyr.informatik.hs-fulda.de:23227] mca: base: components_register: component 
> rsh register function successful
> [tyr.informatik.hs-fulda.de:23227] mca: base: components_open: opening plm 
> components
> [tyr.informatik.hs-fulda.de:23227] mca: base: components_open: found loaded 
> component rsh
> [tyr.informatik.hs-fulda.de:23227] mca: base: components_open: component rsh 
> open function successful
> [tyr.informatik.hs-fulda.de:23227] mca:base:select: Auto-selecting plm 
> components
> [tyr.informatik.hs-fulda.de:23227] mca:base:select:(  plm) Querying component 
> [rsh]
> [tyr.informatik.hs-fulda.de:23227] [[14523,0],3] plm:rsh_lookup on agent ssh 
> : rsh path NULL
> [tyr.informatik.hs-fulda.de:23227] mca:base:select:(  plm) Query of component 
> [rsh] set priority to 10
> [tyr.informatik.hs-fulda.de:23227] mca:base:select:(  plm) Selected component 
> [rsh]
> [tyr.informatik.hs-fulda.de:23227] [[14523,0],3] plm:rsh_setup on agent ssh : 
> rsh path NULL
> [tyr.informatik.hs-fulda.de:23227] [[14523,0],3] plm:base:receive start comm
> [tyr.informatik.hs-fulda.de:23227] [[14523,0],3] plm:base:receive stop comm
> [tyr.informatik.hs-fulda.de:23227] mca: base: close: component rsh closed
> [tyr.informatik.hs-fulda.de:23227] mca: base: close: unloading component rsh
> [linpc0:32306] [[14523,0],1] daemon 3 failed with status 1
> [linpc1:20650] [[14523,0],0] plm:base:orted_cmd sending orted_exit commands
> [linpc1:20650] [[14523,0],0] plm:base:receive stop comm
> [linpc1:20650] mca: base: close: component rsh closed
> [linpc1:20650] mca: base: close: unloading component rsh
> linpc1 openmpi_1.7.x_or_newer 189 [sunpc1:09408] [[14523,0],2] 
> plm:base:receive stop comm
> [sunpc1:09408] mca: base: close: component rsh closed
> [sunpc1:09408] mca: base: close: unloading component rsh
> [linpc0:32306] [[14523,0],1] plm:base:receive stop comm
> [linpc0:32306] mca: base: close: component rsh closed
> [linpc0:32306] mca: base: close: unloading component rsh
> 
> linpc1 openmpi_1.7.x_or_newer 189 
> 
> 
> 
> linpc1 openmpi_1.7.x_or_newer 189 mpiexec -report-bindings 
> --display-allocation --mca rmaps_base_verbose_100 -np 1 -rf 
> rf_linpc_sunpc_tyr hostname
> 
> ======================   ALLOCATED NODES   ======================
>        linpc1: slots=1 max_slots=0 slots_inuse=0
> =================================================================
> --------------------------------------------------------------------------
> mpiexec was unable to find the specified executable file, and therefore
> did not launch the job.  This error was first reported for process
> rank 0; it may have occurred for other processes as well.
> 
> NOTE: A common cause for this error is misspelling a mpiexec command
>      line parameter option (remember that mpiexec interprets the first
>      unrecognized command line token as the executable).
> 
> Node:       linpc1
> Executable: 1
> --------------------------------------------------------------------------
> linpc1 openmpi_1.7.x_or_newer 190 
> 
> 
> 
> 
> Kind regards
> 
> Siegmar
> 
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/users

Reply via email to