Hi Ralph,
Am 21.04.2016 um 00:18 schrieb Ralph Castain:
Could you please rerun these test and add “-mca pmix_base_verbose 10
-mca pmix_server_verbose 5” to your cmd line? I need to see why the
pmix components failed.
tyr spawn 111 mpiexec -np 1 --host tyr,sunpc1,linpc1,ruester -mca
pmix_base_verbose 10 -mca pmix_server_verbose 5 spawn_multiple_master
[tyr.informatik.hs-fulda.de:26652] mca: base: components_register: registering
framework pmix components
[tyr.informatik.hs-fulda.de:26652] mca: base: components_open: opening pmix
components
[tyr.informatik.hs-fulda.de:26652] mca:base:select: Auto-selecting pmix
components
[tyr.informatik.hs-fulda.de:26652] mca:base:select:( pmix) No component
selected!
[tyr.informatik.hs-fulda.de:26652] [[52794,0],0] ORTE_ERROR_LOG: Not found in
file
../../../../../openmpi-v2.x-dev-1280-gc110ae8/orte/mca/ess/hnp/ess_hnp_module.c
at line 638
--------------------------------------------------------------------------
It looks like orte_init failed for some reason; your parallel process is
likely to abort. There are many reasons that a parallel process can
fail during orte_init; some of which are due to configuration or
environment problems. This failure appears to be an internal failure;
here's some additional information (which may only be relevant to an
Open MPI developer):
opal_pmix_base_select failed
--> Returned value Not found (-13) instead of ORTE_SUCCESS
--------------------------------------------------------------------------
tyr spawn 112
tyr hello_1 116 mpiexec -np 1 --host tyr,sunpc1,linpc1,ruester -mca
pmix_base_verbose 10 -mca pmix_server_verbose 5 hello_1_mpi
[tyr.informatik.hs-fulda.de:27261] mca: base: components_register: registering
framework pmix components
[tyr.informatik.hs-fulda.de:27261] mca: base: components_open: opening pmix
components
[tyr.informatik.hs-fulda.de:27261] mca:base:select: Auto-selecting pmix
components
[tyr.informatik.hs-fulda.de:27261] mca:base:select:( pmix) No component
selected!
[tyr.informatik.hs-fulda.de:27261] [[52315,0],0] ORTE_ERROR_LOG: Not found in
file
../../../../../openmpi-v2.x-dev-1280-gc110ae8/orte/mca/ess/hnp/ess_hnp_module.c
at line 638
--------------------------------------------------------------------------
It looks like orte_init failed for some reason; your parallel process is
likely to abort. There are many reasons that a parallel process can
fail during orte_init; some of which are due to configuration or
environment problems. This failure appears to be an internal failure;
here's some additional information (which may only be relevant to an
Open MPI developer):
opal_pmix_base_select failed
--> Returned value Not found (-13) instead of ORTE_SUCCESS
--------------------------------------------------------------------------
tyr hello_1 117
Thank you very much for your help.
Kind regards
Siegmar
Thanks
Ralph
On Apr 20, 2016, at 10:12 AM, Siegmar Gross
<siegmar.gr...@informatik.hs-fulda.de> wrote:
Hi,
I have built openmpi-v2.x-dev-1280-gc110ae8 on my machines
(Solaris 10 Sparc, Solaris 10 x86_64, and openSUSE Linux
12.1 x86_64) with gcc-5.1.0 and Sun C 5.13. Unfortunately I get
runtime errors for some programs.
Sun C 5.13:
===========
For all my test programs I get the same error on Solaris Sparc and
Solaris x86_64, while the programs work fine on Linux.
tyr hello_1 115 mpiexec -np 2 hello_1_mpi
[tyr.informatik.hs-fulda.de:22373] [[61763,0],0] ORTE_ERROR_LOG: Not found in
file
../../../../../openmpi-v2.x-dev-1280-gc110ae8/orte/mca/ess/hnp/ess_hnp_module.c
at line 638
--------------------------------------------------------------------------
It looks like orte_init failed for some reason; your parallel process is
likely to abort. There are many reasons that a parallel process can
fail during orte_init; some of which are due to configuration or
environment problems. This failure appears to be an internal failure;
here's some additional information (which may only be relevant to an
Open MPI developer):
opal_pmix_base_select failed
--> Returned value Not found (-13) instead of ORTE_SUCCESS
--------------------------------------------------------------------------
tyr hello_1 116
GCC-5.1.0:
==========
tyr spawn 121 mpiexec -np 1 --host tyr,sunpc1,linpc1,ruester
spawn_multiple_master
Parent process 0 running on tyr.informatik.hs-fulda.de
I create 3 slave processes.
[tyr.informatik.hs-fulda.de:25366] PMIX ERROR: UNPACK-PAST-END in file
../../../../../../openmpi-v2.x-dev-1280-gc110ae8/opal/mca/pmix/pmix112/pmix/src/server/pmix_server_ops.c
at line 829
[tyr.informatik.hs-fulda.de:25366] PMIX ERROR: UNPACK-PAST-END in file
../../../../../../openmpi-v2.x-dev-1280-gc110ae8/opal/mca/pmix/pmix112/pmix/src/server/pmix_server.c
at line 2176
[tyr:25377] *** An error occurred in MPI_Comm_spawn_multiple
[tyr:25377] *** reported by process [3308257281,0]
[tyr:25377] *** on communicator MPI_COMM_WORLD
[tyr:25377] *** MPI_ERR_SPAWN: could not spawn processes
[tyr:25377] *** MPI_ERRORS_ARE_FATAL (processes in this communicator will now
abort,
[tyr:25377] *** and potentially your MPI job)
tyr spawn 122
I would be grateful if somebody can fix the problems. Thank you very
much for any help in advance.
Kind regards
Siegmar
<hello_1_mpi.c><spawn_multiple_master.c>_______________________________________________
users mailing list
us...@open-mpi.org
Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
Link to this post:
http://www.open-mpi.org/community/lists/users/2016/04/28983.php
_______________________________________________
users mailing list
us...@open-mpi.org
Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
Link to this post:
http://www.open-mpi.org/community/lists/users/2016/04/28986.php