Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-29 Thread Siegmar Gross
Hi Ralph, Siegmar: have you tried the latest release candidate? Yes, it is still broken. Kind regards and thank you very much for your help Siegmar

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-29 Thread Ralph Castain
https://github.com/open-mpi/ompi-release/pull/1117 > On Apr 29, 2016, at 7:38 AM, Ralph Castain wrote: > > Ah, okay - I can fix that line. Thanks for pointing it out. > > Given that the rest of the code uses the app[i] syntax, I’d rather le

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-29 Thread Ralph Castain
Ah, okay - I can fix that line. Thanks for pointing it out. Given that the rest of the code uses the app[i] syntax, I’d rather leave that alone. > On Apr 29, 2016, at 7:27 AM, Gilles Gouaillardet > wrote: > > the second for loop is incorrect > > it reads > for (j=0; j < app->argc; j++) > bu

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-29 Thread Gilles Gouaillardet
the second for loop is incorrect it reads for (j=0; j < app->argc; j++) but should be for (j=0; j < app[i].argc; j++) as a matter of taste, I'd rather replace all app[i]. with app-> and app++; at the end (or in the for) of the outermost loop Cheers, Gilles On Friday, April 29, 2016, Ralph Cas

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-29 Thread Ralph Castain
Hmmm…well, I may have to wait and let Gilles fix this. So far as I can see, the code in the current OMPI 2.x tarball (and upstream) is correct: int pmix_bfrop_pack_app(pmix_buffer_t *buffer, const void *src, int32_t num_vals, pmix_data_type_t type) { pmix_app_t *app;

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-29 Thread Siegmar Gross
Hi Gilles, thank you very much for identifying the reason for the problem and fixing it. Have a nice weekend Siegmar Am 29.04.2016 um 03:38 schrieb Gilles Gouaillardet: Siegmar, in pmix_bfrop_pack_app, app->argc must be replaced with app[i].argc I will PR to pmix, ompi and ompi-release when

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-29 Thread Ralph Castain
Ouch - thanks for finding this, Gilles! I’ll take care of it on Friday. > On Apr 28, 2016, at 6:38 PM, Gilles Gouaillardet > wrote: > > Siegmar, > > in pmix_bfrop_pack_app, > app->argc > must be replaced with > app[i].argc > > I will PR to pmix, ompi and ompi-release when I am back at work on

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-28 Thread Gilles Gouaillardet
Siegmar, in pmix_bfrop_pack_app, app->argc must be replaced with app[i].argc I will PR to pmix, ompi and ompi-release when I am back at work on Monday Cheers, Gilles On Thursday, April 28, 2016, Gilles Gouaillardet wrote: > Siegmar, > > > can you please also post the source of spawn_slave ?

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-28 Thread Siegmar Gross
Hi Gilles, I'm sorry that I forgot the file. Kind regards Siegmar Am 28.04.2016 um 03:34 schrieb Gilles Gouaillardet: Siegmar, can you please also post the source of spawn_slave ? Cheers, Gilles On 4/28/2016 1:17 AM, Siegmar Gross wrote: Hi Gilles, it is not necessary to have a hete

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-27 Thread Gilles Gouaillardet
Siegmar, can you please also post the source of spawn_slave ? Cheers, Gilles On 4/28/2016 1:17 AM, Siegmar Gross wrote: Hi Gilles, it is not necessary to have a heterogeneous environment to reproduce the error as you can see below. All machines are 64 bit. tyr spawn 119 ompi_info | grep

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-27 Thread Siegmar Gross
Hi Gilles, it is not necessary to have a heterogeneous environment to reproduce the error as you can see below. All machines are 64 bit. tyr spawn 119 ompi_info | grep -e "OPAL repo revision" -e "C compiler absolute" OPAL repo revision: v2.x-dev-1290-gbd0e4e1 C compiler absolute: /usr

[OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-27 Thread Gilles Gouaillardet
Siegmar, please add this to your CFLAGS for the time being. configure tries to detect which flags must be added for C99 support, and it seems the test is not working for Solaris 10 and Oracle compilers. this is no more a widely used environment, and I am not sure I can find the time to fix this i

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-27 Thread Siegmar Gross
Hi Gilles, adding "-std=c99" to CFLAGS solves the problem with the missing library. Shall I add it permanently to my configure command or will you add it, so that I will not run into problems if you need the C11 standard later? "spawn_multiple_master" breaks with the same error that I reported y

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-27 Thread Gilles Gouaillardet
Siegmar, here is the error : configure:17969: cc -o conftest -m64 -D_REENTRANT -g -g -I/export2/src/openmpi-2.0.0/openmpi-v2.x-dev-1290-gbd0e4e1 -I/export2/src/openmpi-2.0.0/openmpi-v2.x-dev-1290-gbd0e4e1-SunOS.sparc.64_cc -I/export2/src/openmpi-2.0.0/openmpi-v2.x-dev-1290-gbd0e4e1/opal/inc

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-26 Thread Siegmar Gross
Hi Gilles and Ralph, I was able to sort out my mess. In my last email I compared the files from "SunOS_sparc/openmpi-2.0.0_64_gcc/lib64/openmpi" from the attachment of my email to Ralph with the files from "SunOS_sparc/openmpi-2.0.0_64_cc/lib64/openmpi" from my current file system. That's the rea

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-23 Thread Siegmar Gross
Hi Gilles, I don't know what happened, but the files are not available now and they were definitely available when I answered the email from Ralph. The files also have a different timestamp now. This is an extract from my email to Ralph for Solaris Sparc. -rwxr-xr-x 1 root root 977 Apr 19 19

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-22 Thread Gilles Gouaillardet
Siegmar, I will try to reproduce this on my solaris11 x86_64 vm In the mean time, can you please double check mca_pmix_pmix_pmix112.so is a 64 bits library ? (E.g, confirm "-m64" was correctly passed to pmix) Cheers, Gilles On Friday, April 22, 2016, Siegmar Gross < siegmar.gr...@informatik.hs

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-21 Thread Ralph Castain
Then I must surrender - I cannot understand your output in that context. Perhaps Gilles or someone with access to Solaris or Sparc can take a look as this makes no sense to me. Sorry! Ralph > On Apr 21, 2016, at 9:30 AM, Siegmar Gross > wrote: > > Hi Ralph, > > I've already used "-enable-d

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-21 Thread Siegmar Gross
Hi Ralph, I've already used "-enable-debug". "SYSTEM_ENV" is "SunOS" or "Linux" and "MACHINE_ENV" is "sparc" or "x86_84". mkdir openmpi-v2.x-dev-1280-gc110ae8-${SYSTEM_ENV}.${MACHINE_ENV}.64_gcc cd openmpi-v2.x-dev-1280-gc110ae8-${SYSTEM_ENV}.${MACHINE_ENV}.64_gcc ../openmpi-v2.x-dev-1280-gc110

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-21 Thread Ralph Castain
Can you please rebuild OMPI with -enable-debug in the configure cmd? It will let us see more error output > On Apr 21, 2016, at 8:52 AM, Siegmar Gross > wrote: > > Hi Ralph, > > I don't see any additional information. > > tyr hello_1 108 mpiexec -np 4 --host tyr,sunpc1,linpc1,ruester -mca

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-21 Thread Siegmar Gross
Hi Ralph, I don't see any additional information. tyr hello_1 108 mpiexec -np 4 --host tyr,sunpc1,linpc1,ruester -mca mca_base_component_show_load_errors 1 hello_1_mpi [tyr.informatik.hs-fulda.de:06211] [[48741,0],0] ORTE_ERROR_LOG: Not found in file ../../../../../openmpi-v2.x-dev-1280-gc110

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-21 Thread Ralph Castain
Hmmm…it looks like you built the right components, but they are not being picked up. Can you run your mpiexec command again, adding “-mca mca_base_component_show_load_errors 1” to the cmd line? > On Apr 21, 2016, at 8:16 AM, Siegmar Gross > wrote: > > Hi Ralph, > > I have attached ompi_info

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-21 Thread Siegmar Gross
Hi Ralph, I have attached ompi_info output for both compilers from my sparc machine and the listings for both compilers from the /lib/openmpi directories. Hopefully that helps to find the problem. hermes tmp 3 tar zvft openmpi-2.x_info.tar.gz -rw-r--r-- root/root 10969 2016-04-21 17:06 ompi_

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-21 Thread Ralph Castain
Odd - it would appear that none of the pmix components built? Can you send along the output from ompi_info? Or just send a listing of the files in the /lib/openmpi directory? > On Apr 21, 2016, at 1:27 AM, Siegmar Gross > wrote: > > Hi Ralph, > > Am 21.04.2016 um 00:18 schrieb Ralph Castain

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-21 Thread Siegmar Gross
Hi Ralph, Am 21.04.2016 um 00:18 schrieb Ralph Castain: Could you please rerun these test and add “-mca pmix_base_verbose 10 -mca pmix_server_verbose 5” to your cmd line? I need to see why the pmix components failed. tyr spawn 111 mpiexec -np 1 --host tyr,sunpc1,linpc1,ruester -mca pmix_base

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-20 Thread Ralph Castain
Could you please rerun these test and add “-mca pmix_base_verbose 10 -mca pmix_server_verbose 5” to your cmd line? I need to see why the pmix components failed. Thanks Ralph > On Apr 20, 2016, at 10:12 AM, Siegmar Gross > wrote: > > Hi, > > I have built openmpi-v2.x-dev-1280-gc110ae8 on my

[OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-20 Thread Siegmar Gross
Hi, I have built openmpi-v2.x-dev-1280-gc110ae8 on my machines (Solaris 10 Sparc, Solaris 10 x86_64, and openSUSE Linux 12.1 x86_64) with gcc-5.1.0 and Sun C 5.13. Unfortunately I get runtime errors for some programs. Sun C 5.13: === For all my test programs I get the same error on Sol