Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-29 Thread Siegmar Gross
Hi Ralph, Siegmar: have you tried the latest release candidate? Yes, it is still broken. Kind regards and thank you very much for your help Siegmar

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-29 Thread Ralph Castain
https://github.com/open-mpi/ompi-release/pull/1117 > On Apr 29, 2016, at 7:38 AM, Ralph Castain wrote: > > Ah, okay - I can fix that line. Thanks for pointing it out. > > Given that the rest of the code uses the app[i] syntax, I’d rather le

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-29 Thread Ralph Castain
Ah, okay - I can fix that line. Thanks for pointing it out. Given that the rest of the code uses the app[i] syntax, I’d rather leave that alone. > On Apr 29, 2016, at 7:27 AM, Gilles Gouaillardet > wrote: > > the second for loop is incorrect > > it reads > for (j=0; j < app->argc; j++) > bu

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-29 Thread Gilles Gouaillardet
the second for loop is incorrect it reads for (j=0; j < app->argc; j++) but should be for (j=0; j < app[i].argc; j++) as a matter of taste, I'd rather replace all app[i]. with app-> and app++; at the end (or in the for) of the outermost loop Cheers, Gilles On Friday, April 29, 2016, Ralph Cas

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-29 Thread Ralph Castain
Hmmm…well, I may have to wait and let Gilles fix this. So far as I can see, the code in the current OMPI 2.x tarball (and upstream) is correct: int pmix_bfrop_pack_app(pmix_buffer_t *buffer, const void *src, int32_t num_vals, pmix_data_type_t type) { pmix_app_t *app;

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-29 Thread Siegmar Gross
Hi Gilles, thank you very much for identifying the reason for the problem and fixing it. Have a nice weekend Siegmar Am 29.04.2016 um 03:38 schrieb Gilles Gouaillardet: Siegmar, in pmix_bfrop_pack_app, app->argc must be replaced with app[i].argc I will PR to pmix, ompi and ompi-release when

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-29 Thread Ralph Castain
Ouch - thanks for finding this, Gilles! I’ll take care of it on Friday. > On Apr 28, 2016, at 6:38 PM, Gilles Gouaillardet > wrote: > > Siegmar, > > in pmix_bfrop_pack_app, > app->argc > must be replaced with > app[i].argc > > I will PR to pmix, ompi and ompi-release when I am back at work on

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-28 Thread Gilles Gouaillardet
Siegmar, in pmix_bfrop_pack_app, app->argc must be replaced with app[i].argc I will PR to pmix, ompi and ompi-release when I am back at work on Monday Cheers, Gilles On Thursday, April 28, 2016, Gilles Gouaillardet wrote: > Siegmar, > > > can you please also post the source of spawn_slave ?

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-28 Thread Siegmar Gross
Hi Gilles, I'm sorry that I forgot the file. Kind regards Siegmar Am 28.04.2016 um 03:34 schrieb Gilles Gouaillardet: Siegmar, can you please also post the source of spawn_slave ? Cheers, Gilles On 4/28/2016 1:17 AM, Siegmar Gross wrote: Hi Gilles, it is not necessary to have a hete

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-27 Thread Gilles Gouaillardet
Siegmar, can you please also post the source of spawn_slave ? Cheers, Gilles On 4/28/2016 1:17 AM, Siegmar Gross wrote: Hi Gilles, it is not necessary to have a heterogeneous environment to reproduce the error as you can see below. All machines are 64 bit. tyr spawn 119 ompi_info | grep

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-27 Thread Siegmar Gross
Hi Gilles, it is not necessary to have a heterogeneous environment to reproduce the error as you can see below. All machines are 64 bit. tyr spawn 119 ompi_info | grep -e "OPAL repo revision" -e "C compiler absolute" OPAL repo revision: v2.x-dev-1290-gbd0e4e1 C compiler absolute: /usr

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-27 Thread Siegmar Gross
Hi Gilles, adding "-std=c99" to CFLAGS solves the problem with the missing library. Shall I add it permanently to my configure command or will you add it, so that I will not run into problems if you need the C11 standard later? "spawn_multiple_master" breaks with the same error that I reported y

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-27 Thread Gilles Gouaillardet
Siegmar, here is the error : configure:17969: cc -o conftest -m64 -D_REENTRANT -g -g -I/export2/src/openmpi-2.0.0/openmpi-v2.x-dev-1290-gbd0e4e1 -I/export2/src/openmpi-2.0.0/openmpi-v2.x-dev-1290-gbd0e4e1-SunOS.sparc.64_cc -I/export2/src/openmpi-2.0.0/openmpi-v2.x-dev-1290-gbd0e4e1/opal/inc

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-26 Thread Siegmar Gross
Hi Gilles and Ralph, I was able to sort out my mess. In my last email I compared the files from "SunOS_sparc/openmpi-2.0.0_64_gcc/lib64/openmpi" from the attachment of my email to Ralph with the files from "SunOS_sparc/openmpi-2.0.0_64_cc/lib64/openmpi" from my current file system. That's the rea

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-23 Thread Siegmar Gross
Hi Gilles, I don't know what happened, but the files are not available now and they were definitely available when I answered the email from Ralph. The files also have a different timestamp now. This is an extract from my email to Ralph for Solaris Sparc. -rwxr-xr-x 1 root root 977 Apr 19 19

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-22 Thread Gilles Gouaillardet
Siegmar, I will try to reproduce this on my solaris11 x86_64 vm In the mean time, can you please double check mca_pmix_pmix_pmix112.so is a 64 bits library ? (E.g, confirm "-m64" was correctly passed to pmix) Cheers, Gilles On Friday, April 22, 2016, Siegmar Gross < siegmar.gr...@informatik.hs

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-21 Thread Ralph Castain
Then I must surrender - I cannot understand your output in that context. Perhaps Gilles or someone with access to Solaris or Sparc can take a look as this makes no sense to me. Sorry! Ralph > On Apr 21, 2016, at 9:30 AM, Siegmar Gross > wrote: > > Hi Ralph, > > I've already used "-enable-d

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-21 Thread Siegmar Gross
Hi Ralph, I've already used "-enable-debug". "SYSTEM_ENV" is "SunOS" or "Linux" and "MACHINE_ENV" is "sparc" or "x86_84". mkdir openmpi-v2.x-dev-1280-gc110ae8-${SYSTEM_ENV}.${MACHINE_ENV}.64_gcc cd openmpi-v2.x-dev-1280-gc110ae8-${SYSTEM_ENV}.${MACHINE_ENV}.64_gcc ../openmpi-v2.x-dev-1280-gc110

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-21 Thread Ralph Castain
Can you please rebuild OMPI with -enable-debug in the configure cmd? It will let us see more error output > On Apr 21, 2016, at 8:52 AM, Siegmar Gross > wrote: > > Hi Ralph, > > I don't see any additional information. > > tyr hello_1 108 mpiexec -np 4 --host tyr,sunpc1,linpc1,ruester -mca

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-21 Thread Siegmar Gross
Hi Ralph, I don't see any additional information. tyr hello_1 108 mpiexec -np 4 --host tyr,sunpc1,linpc1,ruester -mca mca_base_component_show_load_errors 1 hello_1_mpi [tyr.informatik.hs-fulda.de:06211] [[48741,0],0] ORTE_ERROR_LOG: Not found in file ../../../../../openmpi-v2.x-dev-1280-gc110

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-21 Thread Ralph Castain
Hmmm…it looks like you built the right components, but they are not being picked up. Can you run your mpiexec command again, adding “-mca mca_base_component_show_load_errors 1” to the cmd line? > On Apr 21, 2016, at 8:16 AM, Siegmar Gross > wrote: > > Hi Ralph, > > I have attached ompi_info

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-21 Thread Siegmar Gross
Hi Ralph, I have attached ompi_info output for both compilers from my sparc machine and the listings for both compilers from the /lib/openmpi directories. Hopefully that helps to find the problem. hermes tmp 3 tar zvft openmpi-2.x_info.tar.gz -rw-r--r-- root/root 10969 2016-04-21 17:06 ompi_

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-21 Thread Ralph Castain
Odd - it would appear that none of the pmix components built? Can you send along the output from ompi_info? Or just send a listing of the files in the /lib/openmpi directory? > On Apr 21, 2016, at 1:27 AM, Siegmar Gross > wrote: > > Hi Ralph, > > Am 21.04.2016 um 00:18 schrieb Ralph Castain

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-21 Thread Siegmar Gross
Hi Ralph, Am 21.04.2016 um 00:18 schrieb Ralph Castain: Could you please rerun these test and add “-mca pmix_base_verbose 10 -mca pmix_server_verbose 5” to your cmd line? I need to see why the pmix components failed. tyr spawn 111 mpiexec -np 1 --host tyr,sunpc1,linpc1,ruester -mca pmix_base

Re: [OMPI users] runtime errors for openmpi-v2.x-dev-1280-gc110ae8

2016-04-20 Thread Ralph Castain
Could you please rerun these test and add “-mca pmix_base_verbose 10 -mca pmix_server_verbose 5” to your cmd line? I need to see why the pmix components failed. Thanks Ralph > On Apr 20, 2016, at 10:12 AM, Siegmar Gross > wrote: > > Hi, > > I have built openmpi-v2.x-dev-1280-gc110ae8 on my