Hi Siegmar,

The attached JRE log shows very important information.

When JRE loads the MPI class, JNI_OnLoad function in
libmpi_java.so (Open MPI library; written in C) is called.
And probably mca_base_var_cache_files function passes NULL
to asprintf function. I don't know how this situation occurs.
You may be able to track this by inserting debug printf in
Open MPI code shown in the stack trace, or by using gdb or
something.

hs_err_pid13080.log:
----------------------------------------------------------------
siginfo:si_signo=SIGSEGV: si_errno=0, si_code=1 (SEGV_MAPERR), 
si_addr=0x0000000000000000

Stack: [0xffffffff7b400000,0xffffffff7b500000],  sp=0xffffffff7b4fc730,  free 
space=1009k
Native frames: (J=compiled Java code, j=interpreted, Vv=VM code, C=native code)
C  [libc.so.1+0x3c7f0]  strlen+0x50
C  [libc.so.1+0xaf640]  vsnprintf+0x84
C  [libc.so.1+0xaadb4]  vasprintf+0x20
C  [libc.so.1+0xaaf04]  asprintf+0x28
C  [libopen-pal.so.0.0.0+0xaf3cc]  mca_base_var_cache_files+0x160
C  [libopen-pal.so.0.0.0+0xaed90]  mca_base_var_init+0x4e8
C  [libopen-pal.so.0.0.0+0xb260c]  register_variable+0x214
C  [libopen-pal.so.0.0.0+0xb36a0]  mca_base_var_register+0x104
C  [libmpi_java.so.0.0.0+0x221e8]  JNI_OnLoad+0x128
C  [libjava.so+0x10860]  Java_java_lang_ClassLoader_00024NativeLibrary_load+0xb8
j  java.lang.ClassLoader$NativeLibrary.load(Ljava/lang/String;Z)V+-665819
j  java.lang.ClassLoader$NativeLibrary.load(Ljava/lang/String;Z)V+0
j  java.lang.ClassLoader.loadLibrary0(Ljava/lang/Class;Ljava/io/File;)Z+328
j  java.lang.ClassLoader.loadLibrary(Ljava/lang/Class;Ljava/lang/String;Z)V+290
j  java.lang.Runtime.loadLibrary0(Ljava/lang/Class;Ljava/lang/String;)V+54
j  java.lang.System.loadLibrary(Ljava/lang/String;)V+7
j  mpi.MPI.<clinit>()V+28
----------------------------------------------------------------

Regards,
Takahiro

> Hello Oscar,
> 
> do you have time to look into my problem? Probably Takahiro has a
> point and gdb behaves differently on Solaris and Linux, so that
> the differing outputs have no meaning. I tried to debug my Java
> program, but without success so far, because I wasn't able to get
> into the Java program to set a breakpoint or to see the code. Have
> you succeeded to debug a mpiJava program? If so, how must I call
> gdb (I normally use "gdb mipexec" and then "run -np 1 java ...")?
> What can I do to get helpful information to track the error down?
> I have attached the error log file. Perhaps you can see if something
> is going wrong with the Java interface. Thank you very much for your
> help and any hints for the usage of gdb with mpiJava in advance.
> Please let me know if I can provide anything else.
> 
> 
> Kind regards
> 
> Siegmar
> 
> 
> > > I think that it must have to do with MPI, because everything
> > > works fine on Linux and my Java program works fine with an older
> > > MPI version (openmpi-1.8.2a1r31804) as well.
> > 
> > Yes. I also think it must have to do with MPI.
> > But java process side, not mpiexec process side.
> > 
> > When you run Java MPI program via mpiexec, a mpiexec process
> > process launch a java process. When the java process (your
> > Java program) calls a MPI method, native part (written in C/C++)
> > of the MPI library is called. It runs in java process, not in
> > mpiexec process. I suspect that part.
> > 
> > > On Solaris things are different.
> > 
> > Are you saying the following difference?
> > After this line,
> > > 881             ORTE_ACTIVATE_JOB_STATE(jdata, ORTE_JOB_STATE_INIT);
> > Linux shows
> > > orte_job_state_to_str (state=1)
> > >     at ../../openmpi-dev-124-g91e9686/orte/util/error_strings.c:217
> > > 217         switch(state) {
> > but Solaris shows
> > > orte_util_print_name_args (name=0x100118380 <orte_process_info+104>)
> > >     at ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:122
> > > 122         if (NULL == name) {
> > 
> > Each macro is defined as:
> > 
> > #define ORTE_ACTIVATE_JOB_STATE(j, s)                                   \
> >     do {                                                                \
> >         orte_job_t *shadow=(j);                                         \
> >         opal_output_verbose(1, orte_state_base_framework.framework_output, \
> >                             "%s ACTIVATE JOB %s STATE %s AT %s:%d", \
> >                             ORTE_NAME_PRINT(ORTE_PROC_MY_NAME),         \
> >                             (NULL == shadow) ? "NULL" :                 \
> >                             ORTE_JOBID_PRINT(shadow->jobid),                
> > \
> >                             orte_job_state_to_str((s)),                 \
> >                             __FILE__, __LINE__);                    \
> >         orte_state.activate_job_state(shadow, (s));                     \
> >     } while(0);
> > 
> > #define ORTE_NAME_PRINT(n) \
> >     orte_util_print_name_args(n)
> > 
> > #define ORTE_JOBID_PRINT(n) \
> >     orte_util_print_jobids(n)
> > 
> > I'm not sure, but I think the gdb on Solaris steps into
> > orte_util_print_name_args, but gdb on Linux doesn't step into
> > orte_util_print_name_args and orte_util_print_jobids for some
> > reason, or orte_job_state_to_str is evaluated before them.
> > 
> > So I think it's not an important difference.
> > 
> > You showed the following lines.
> > > > > orterun (argc=5, argv=0xffffffff7fffe0d8)
> > > > >     at 
> ../../../../openmpi-dev-124-g91e9686/orte/tools/orterun/orterun.c:1084
> > > > > 1084        while (orte_event_base_active) {
> > > > > (gdb) 
> > > > > 1085            opal_event_loop(orte_event_base, OPAL_EVLOOP_ONCE);
> > > > > (gdb) 
> > 
> > I'm not familiar with this code but I think this part (in mpiexec
> > process) is only waiting the java process to terminate (normally
> > or abnormally). So I think the problem is not in a mpiexec process
> > but in a java process.
> > 
> > Regards,
> > Takahiro
> > 
> > > Hi Takahiro,
> > > 
> > > > mpiexec and java run as distinct processes. Your JRE message
> > > > says java process raises SEGV. So you should trace the java
> > > > process, not the mpiexec process. And more, your JRE message
> > > > says the crash happened outside the Java Virtual Machine in
> > > > native code. So usual Java program debugger is useless.
> > > > You should trace native code part of the java process.
> > > > Unfortunately I don't know how to debug such one.
> > > 
> > > I think that it must have to do with MPI, because everything
> > > works fine on Linux and my Java program works fine with an older
> > > MPI version (openmpi-1.8.2a1r31804) as well.
> > > 
> > > linpc1 x 112 mpiexec -np 1 java InitFinalizeMain
> > > Hello!
> > > linpc1 x 113 
> > > 
> > > Therefore I single stepped through the program on Linux as well
> > > and found a difference launching the process. On Linux I get the
> > > following sequence.
> > > 
> > > Breakpoint 1, rsh_launch (jdata=0x614aa0)
> > >     at 
> ../../../../../openmpi-dev-124-g91e9686/orte/mca/plm/rsh/plm_rsh_module.c:876
> > > 876         if (ORTE_FLAG_TEST(jdata, ORTE_JOB_FLAG_RESTART)) {
> > > (gdb) s
> > > 881             ORTE_ACTIVATE_JOB_STATE(jdata, ORTE_JOB_STATE_INIT);
> > > (gdb) s
> > > orte_job_state_to_str (state=1)
> > >     at ../../openmpi-dev-124-g91e9686/orte/util/error_strings.c:217
> > > 217         switch(state) {
> > > (gdb) 
> > > 221             return "PENDING INIT";
> > > (gdb) 
> > > 317     }
> > > (gdb) 
> > > orte_util_print_jobids (job=4294967295)
> > >     at ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:170
> > > 170         ptr = get_print_name_buffer();
> > > (gdb) 
> > > 
> > > 
> > > 
> > > On Solaris things are different.
> > > 
> > > Breakpoint 1, rsh_launch (jdata=0x100125250)
> > >     at 
> ../../../../../openmpi-dev-124-g91e9686/orte/mca/plm/rsh/plm_rsh_module.c:876
> > > 876         if (ORTE_FLAG_TEST(jdata, ORTE_JOB_FLAG_RESTART)) {
> > > (gdb) s
> > > 881             ORTE_ACTIVATE_JOB_STATE(jdata, ORTE_JOB_STATE_INIT);
> > > (gdb) s
> > > orte_util_print_name_args (name=0x100118380 <orte_process_info+104>)
> > >     at ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:122
> > > 122         if (NULL == name) {
> > > (gdb) 
> > > 142         job = orte_util_print_jobids(name->jobid);
> > > (gdb) 
> > > orte_util_print_jobids (job=2673410048)
> > >     at ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:170
> > > 170         ptr = get_print_name_buffer();
> > > (gdb) 
> > > 
> > > 
> > > 
> > > Is this normal or is it the reason for the crash on Solaris?
> > > 
> > > 
> > > Kind regards
> > > 
> > > Siegmar
> > > 
> > > 
> > > 
> > > 
> > > 
> > > 
> > > 
> > > > The log file output by JRE may help you.
> > > > > # An error report file with more information is saved as:
> > > > > # 
> /home/fd1026/work/skripte/master/parallel/prog/mpi/java/hs_err_pid13080.log
> > > > 
> > > > Regards,
> > > > Takahiro
> > > > 
> > > > > Hi,
> > > > > 
> > > > > I installed openmpi-dev-124-g91e9686 on Solaris 10 Sparc with
> > > > > gcc-4.9.1 to track down the error with my small Java program.
> > > > > I started single stepping in orterun.c at line 1081 and
> > > > > continued until I got the segmentation fault. I get
> > > > > "jdata = 0x0" in version openmpi-1.8.2a1r31804, which is the
> > > > > last one which works with Java in my environment, while I get
> > > > > "jdata = 0x100125250" in this version. Unfortunately I don't
> > > > > know which files or variables are important to look at. Perhaps
> > > > > somebody can look at the following lines of code and tell me,
> > > > > which information I should provide to solve the problem. I know
> > > > > that Solaris isn't any longer on your list of supported systems,
> > > > > but perhaps we can get it working again, if you tell me what
> > > > > you need and I do the debugging.
> > > > > 
> > > > > /usr/local/gdb-7.6.1_64_gcc/bin/gdb mpiexec
> > > > > GNU gdb (GDB) 7.6.1
> > > > > ...
> > > > > (gdb) run -np 1 java InitFinalizeMain 
> > > > > Starting program: /usr/local/openmpi-1.9.0_64_gcc/bin/mpiexec \
> > > > >   -np 1 java InitFinalizeMain
> > > > > [Thread debugging using libthread_db enabled]
> > > > > [New Thread 1 (LWP 1)]
> > > > > [New LWP    2        ]
> > > > > #
> > > > > # A fatal error has been detected by the Java Runtime Environment:
> > > > > #
> > > > > #  SIGSEGV (0xb) at pc=0xffffffff7ea3c7f0, pid=13064, tid=2
> > > > > ...
> > > > > [LWP    2         exited]
> > > > > [New Thread 2        ]
> > > > > [Switching to Thread 1 (LWP 1)]
> > > > > sol_thread_fetch_registers: td_ta_map_id2thr: no thread can be
> > > > >   found to satisfy query
> > > > > (gdb) thread 1
> > > > > [Switching to thread 1 (LWP    1        )]
> > > > > #0  0xffffffff7f6173d0 in rtld_db_dlactivity () from 
> /usr/lib/sparcv9/ld.so.1
> > > > > (gdb) b orterun.c:1081
> > > > > Breakpoint 1 at 0x1000070dc: file 
> > > > > ../../../../openmpi-dev-124-g91e9686/orte/tools/orterun/orterun.c, 
> > > > > line 
> 1081.
> > > > > (gdb) r
> > > > > The program being debugged has been started already.
> > > > > Start it from the beginning? (y or n) y
> > > > > 
> > > > > Starting program: /usr/local/openmpi-1.9.0_64_gcc/bin/mpiexec -np 1 
> > > > > java 
> > > > > InitFinalizeMain
> > > > > [Thread debugging using libthread_db enabled]
> > > > > [New Thread 1 (LWP 1)]
> > > > > [New LWP    2        ]
> > > > > [Switching to Thread 1 (LWP 1)]
> > > > > 
> > > > > Breakpoint 1, orterun (argc=5, argv=0xffffffff7fffe0d8)
> > > > >     at 
> ../../../../openmpi-dev-124-g91e9686/orte/tools/orterun/orterun.c:1081
> > > > > 1081        rc = orte_plm.spawn(jdata);
> > > > > (gdb) print jdata
> > > > > $1 = (orte_job_t *) 0x100125250
> > > > > (gdb) s
> > > > > rsh_launch (jdata=0x100125250)
> > > > >     at 
> > > > > 
> ../../../../../openmpi-dev-124-g91e9686/orte/mca/plm/rsh/plm_rsh_module.c:876
> > > > > 876         if (ORTE_FLAG_TEST(jdata, ORTE_JOB_FLAG_RESTART)) {
> > > > > (gdb) s    
> > > > > 881             ORTE_ACTIVATE_JOB_STATE(jdata, ORTE_JOB_STATE_INIT);
> > > > > (gdb) 
> > > > > orte_util_print_name_args (name=0x100118380 <orte_process_info+104>)
> > > > >     at ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:122
> > > > > 122         if (NULL == name) {
> > > > > (gdb) 
> > > > > 142         job = orte_util_print_jobids(name->jobid);
> > > > > (gdb) 
> > > > > orte_util_print_jobids (job=2502885376) at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:170
> > > > > 170         ptr = get_print_name_buffer();
> > > > > (gdb) 
> > > > > get_print_name_buffer () at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:92
> > > > > 92          if (!fns_init) {
> > > > > (gdb) 
> > > > > 101         ret = opal_tsd_getspecific(print_args_tsd_key, 
> (void**)&ptr);
> > > > > (gdb) 
> > > > > opal_tsd_getspecific (key=1, valuep=0xffffffff7fffd990)
> > > > >     at ../../openmpi-dev-124-g91e9686/opal/threads/tsd.h:163
> > > > > 163         *valuep = pthread_getspecific(key);
> > > > > (gdb) 
> > > > > 164         return OPAL_SUCCESS;
> > > > > (gdb) 
> > > > > 165     }
> > > > > (gdb) 
> > > > > get_print_name_buffer () at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:102
> > > > > 102         if (OPAL_SUCCESS != ret) return NULL;
> > > > > (gdb) 
> > > > > 104         if (NULL == ptr) {
> > > > > (gdb) 
> > > > > 113         return (orte_print_args_buffers_t*) ptr;
> > > > > (gdb) 
> > > > > 114     }
> > > > > (gdb) 
> > > > > orte_util_print_jobids (job=2502885376) at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:172
> > > > > 172         if (NULL == ptr) {
> > > > > (gdb) 
> > > > > 178         if (ORTE_PRINT_NAME_ARG_NUM_BUFS == ptr->cntr) {
> > > > > (gdb) 
> > > > > 182         if (ORTE_JOBID_INVALID == job) {
> > > > > (gdb) 
> > > > > 184         } else if (ORTE_JOBID_WILDCARD == job) {
> > > > > (gdb) 
> > > > > 187             tmp1 = ORTE_JOB_FAMILY((unsigned long)job);
> > > > > (gdb) 
> > > > > 188             tmp2 = ORTE_LOCAL_JOBID((unsigned long)job);
> > > > > (gdb) 
> > > > > 189             snprintf(ptr->buffers[ptr->cntr++], 
> > > > > (gdb) 
> > > > > 193         return ptr->buffers[ptr->cntr-1];
> > > > > (gdb) 
> > > > > 194     }
> > > > > (gdb) 
> > > > > orte_util_print_name_args (name=0x100118380 <orte_process_info+104>)
> > > > >     at ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:143
> > > > > 143         vpid = orte_util_print_vpids(name->vpid);
> > > > > (gdb) 
> > > > > orte_util_print_vpids (vpid=0) at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:260
> > > > > 260         ptr = get_print_name_buffer();
> > > > > (gdb) 
> > > > > get_print_name_buffer () at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:92
> > > > > 92          if (!fns_init) {
> > > > > (gdb) 
> > > > > 101         ret = opal_tsd_getspecific(print_args_tsd_key, 
> (void**)&ptr);
> > > > > (gdb) 
> > > > > opal_tsd_getspecific (key=1, valuep=0xffffffff7fffd9a0)
> > > > >     at ../../openmpi-dev-124-g91e9686/opal/threads/tsd.h:163
> > > > > 163         *valuep = pthread_getspecific(key);
> > > > > (gdb) 
> > > > > 164         return OPAL_SUCCESS;
> > > > > (gdb) 
> > > > > 165     }
> > > > > (gdb) 
> > > > > get_print_name_buffer () at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:102
> > > > > 102         if (OPAL_SUCCESS != ret) return NULL;
> > > > > (gdb) 
> > > > > 104         if (NULL == ptr) {
> > > > > (gdb) 
> > > > > 113         return (orte_print_args_buffers_t*) ptr;
> > > > > (gdb) 
> > > > > 114     }
> > > > > (gdb) 
> > > > > orte_util_print_vpids (vpid=0) at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:262
> > > > > 262         if (NULL == ptr) {
> > > > > (gdb) 
> > > > > 268         if (ORTE_PRINT_NAME_ARG_NUM_BUFS == ptr->cntr) {
> > > > > (gdb) 
> > > > > 272         if (ORTE_VPID_INVALID == vpid) {
> > > > > (gdb) 
> > > > > 274         } else if (ORTE_VPID_WILDCARD == vpid) {
> > > > > (gdb) 
> > > > > 277             snprintf(ptr->buffers[ptr->cntr++], 
> > > > > (gdb) 
> > > > > 281         return ptr->buffers[ptr->cntr-1];
> > > > > (gdb) 
> > > > > 282     }
> > > > > (gdb) 
> > > > > orte_util_print_name_args (name=0x100118380 <orte_process_info+104>)
> > > > >     at ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:146
> > > > > 146         ptr = get_print_name_buffer();
> > > > > (gdb) 
> > > > > get_print_name_buffer () at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:92
> > > > > 92          if (!fns_init) {
> > > > > (gdb) 
> > > > > 101         ret = opal_tsd_getspecific(print_args_tsd_key, 
> (void**)&ptr);
> > > > > (gdb) 
> > > > > opal_tsd_getspecific (key=1, valuep=0xffffffff7fffda60)
> > > > >     at ../../openmpi-dev-124-g91e9686/opal/threads/tsd.h:163
> > > > > 163         *valuep = pthread_getspecific(key);
> > > > > (gdb) 
> > > > > 164         return OPAL_SUCCESS;
> > > > > (gdb) 
> > > > > 165     }
> > > > > (gdb) 
> > > > > get_print_name_buffer () at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:102
> > > > > 102         if (OPAL_SUCCESS != ret) return NULL;
> > > > > (gdb) 
> > > > > 104         if (NULL == ptr) {
> > > > > (gdb) 
> > > > > 113         return (orte_print_args_buffers_t*) ptr;
> > > > > (gdb) 
> > > > > 114     }
> > > > > (gdb) 
> > > > > orte_util_print_name_args (name=0x100118380 <orte_process_info+104>)
> > > > >     at ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:148
> > > > > 148         if (NULL == ptr) {
> > > > > (gdb) 
> > > > > 154         if (ORTE_PRINT_NAME_ARG_NUM_BUFS == ptr->cntr) {
> > > > > (gdb) 
> > > > > 158         snprintf(ptr->buffers[ptr->cntr++], 
> > > > > (gdb) 
> > > > > 162         return ptr->buffers[ptr->cntr-1];
> > > > > (gdb) 
> > > > > 163     }
> > > > > (gdb) 
> > > > > orte_util_print_jobids (job=4294967295) at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:170
> > > > > 170         ptr = get_print_name_buffer();
> > > > > (gdb) 
> > > > > get_print_name_buffer () at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:92
> > > > > 92          if (!fns_init) {
> > > > > (gdb) 
> > > > > 101         ret = opal_tsd_getspecific(print_args_tsd_key, 
> (void**)&ptr);
> > > > > (gdb) 
> > > > > opal_tsd_getspecific (key=1, valuep=0xffffffff7fffda60)
> > > > >     at ../../openmpi-dev-124-g91e9686/opal/threads/tsd.h:163
> > > > > 163         *valuep = pthread_getspecific(key);
> > > > > (gdb) 
> > > > > 164         return OPAL_SUCCESS;
> > > > > (gdb) 
> > > > > 165     }
> > > > > (gdb) 
> > > > > get_print_name_buffer () at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:102
> > > > > 102         if (OPAL_SUCCESS != ret) return NULL;
> > > > > (gdb) 
> > > > > 104         if (NULL == ptr) {
> > > > > (gdb) 
> > > > > 113         return (orte_print_args_buffers_t*) ptr;
> > > > > (gdb) 
> > > > > 114     }
> > > > > (gdb) 
> > > > > orte_util_print_jobids (job=4294967295) at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:172
> > > > > 172         if (NULL == ptr) {
> > > > > (gdb) 
> > > > > 178         if (ORTE_PRINT_NAME_ARG_NUM_BUFS == ptr->cntr) {
> > > > > (gdb) 
> > > > > 182         if (ORTE_JOBID_INVALID == job) {
> > > > > (gdb) 
> > > > > 183             snprintf(ptr->buffers[ptr->cntr++], 
> > > > > ORTE_PRINT_NAME_ARGS_MAX_SIZE, "[INVALID]");
> > > > > (gdb) 
> > > > > 193         return ptr->buffers[ptr->cntr-1];
> > > > > (gdb) 
> > > > > 194     }
> > > > > (gdb) 
> > > > > orte_job_state_to_str (state=1) at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/error_strings.c:217
> > > > > 217         switch(state) {
> > > > > (gdb) 
> > > > > 221             return "PENDING INIT";
> > > > > (gdb) 
> > > > > 317     }
> > > > > (gdb) 
> > > > > opal_output_verbose (level=1, output_id=0, 
> > > > >     format=0xffffffff7f14dd98 <orte_job_states> 
> > > > > "\336\257\276\355\336\257\276\355")
> > > > >     at ../../../openmpi-dev-124-g91e9686/opal/util/output.c:373
> > > > > 373             va_start(arglist, format);
> > > > > (gdb) 
> > > > > 369     {
> > > > > (gdb) 
> > > > > 370         if (output_id >= 0 && output_id < OPAL_OUTPUT_MAX_STREAMS 
> > > > > &&
> > > > > (gdb) 
> > > > > 377     }
> > > > > (gdb) 
> > > > > orte_state_base_activate_job_state (jdata=0x100125250, state=1)
> > > > >     at 
> > > > > 
> ../../../../openmpi-dev-124-g91e9686/orte/mca/state/base/state_base_fns.c:33
> > > > > 33          opal_list_item_t *itm, *any=NULL, *error=NULL;
> > > > > (gdb) 
> > > > > 37          for (itm = opal_list_get_first(&orte_job_states);
> > > > > (gdb) 
> > > > > opal_list_get_first (list=0xffffffff7f14dd98 <orte_job_states>)
> > > > >     at ../../../../openmpi-dev-124-g91e9686/opal/class/opal_list.h:320
> > > > > 320         opal_list_item_t* item = 
> > > > > (opal_list_item_t*)list->opal_list_sentinel.opal_list_next;
> > > > > (gdb) 
> > > > > 324         assert(1 == item->opal_list_item_refcount);
> > > > > (gdb) 
> > > > > 325         assert( list == item->opal_list_item_belong_to );
> > > > > (gdb) 
> > > > > 328         return item;
> > > > > (gdb) 
> > > > > 329     }
> > > > > (gdb) 
> > > > > orte_state_base_activate_job_state (jdata=0x100125250, state=1)
> > > > >     at 
> > > > > 
> ../../../../openmpi-dev-124-g91e9686/orte/mca/state/base/state_base_fns.c:38
> > > > > 38               itm != opal_list_get_end(&orte_job_states);
> > > > > (gdb) 
> > > > > opal_list_get_end (list=0xffffffff7f14dd98 <orte_job_states>)
> > > > >     at ../../../../openmpi-dev-124-g91e9686/opal/class/opal_list.h:399
> > > > > 399         return &(list->opal_list_sentinel);
> > > > > (gdb) 
> > > > > 400     }
> > > > > (gdb) 
> > > > > orte_state_base_activate_job_state (jdata=0x100125250, state=1)
> > > > >     at 
> > > > > 
> ../../../../openmpi-dev-124-g91e9686/orte/mca/state/base/state_base_fns.c:37
> > > > > 37          for (itm = opal_list_get_first(&orte_job_states);
> > > > > (gdb) 
> > > > > 40              s = (orte_state_t*)itm;
> > > > > (gdb) 
> > > > > 41              if (s->job_state == ORTE_JOB_STATE_ANY) {
> > > > > (gdb) 
> > > > > 45              if (s->job_state == ORTE_JOB_STATE_ERROR) {
> > > > > (gdb) 
> > > > > 48              if (s->job_state == state) {
> > > > > (gdb) 
> > > > > 49                  OPAL_OUTPUT_VERBOSE((1, 
> > > > > orte_state_base_framework.framework_output,
> > > > > (gdb) 
> > > > > orte_util_print_name_args (name=0x100118380 <orte_process_info+104>)
> > > > >     at ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:122
> > > > > 122         if (NULL == name) {
> > > > > (gdb) 
> > > > > 142         job = orte_util_print_jobids(name->jobid);
> > > > > (gdb) 
> > > > > orte_util_print_jobids (job=2502885376) at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:170
> > > > > 170         ptr = get_print_name_buffer();
> > > > > (gdb) 
> > > > > get_print_name_buffer () at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:92
> > > > > 92          if (!fns_init) {
> > > > > (gdb) 
> > > > > 101         ret = opal_tsd_getspecific(print_args_tsd_key, 
> (void**)&ptr);
> > > > > (gdb) 
> > > > > opal_tsd_getspecific (key=1, valuep=0xffffffff7fffd880)
> > > > >     at ../../openmpi-dev-124-g91e9686/opal/threads/tsd.h:163
> > > > > 163         *valuep = pthread_getspecific(key);
> > > > > (gdb) 
> > > > > 164         return OPAL_SUCCESS;
> > > > > (gdb) 
> > > > > 165     }
> > > > > (gdb) 
> > > > > get_print_name_buffer () at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:102
> > > > > 102         if (OPAL_SUCCESS != ret) return NULL;
> > > > > (gdb) 
> > > > > 104         if (NULL == ptr) {
> > > > > (gdb) 
> > > > > 113         return (orte_print_args_buffers_t*) ptr;
> > > > > (gdb) 
> > > > > 114     }
> > > > > (gdb) 
> > > > > orte_util_print_jobids (job=2502885376) at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:172
> > > > > 172         if (NULL == ptr) {
> > > > > (gdb) 
> > > > > 178         if (ORTE_PRINT_NAME_ARG_NUM_BUFS == ptr->cntr) {
> > > > > (gdb) 
> > > > > 182         if (ORTE_JOBID_INVALID == job) {
> > > > > (gdb) 
> > > > > 184         } else if (ORTE_JOBID_WILDCARD == job) {
> > > > > (gdb) 
> > > > > 187             tmp1 = ORTE_JOB_FAMILY((unsigned long)job);
> > > > > (gdb) 
> > > > > 188             tmp2 = ORTE_LOCAL_JOBID((unsigned long)job);
> > > > > (gdb) 
> > > > > 189             snprintf(ptr->buffers[ptr->cntr++], 
> > > > > (gdb) 
> > > > > 193         return ptr->buffers[ptr->cntr-1];
> > > > > (gdb) 
> > > > > 194     }
> > > > > (gdb) 
> > > > > orte_util_print_name_args (name=0x100118380 <orte_process_info+104>)
> > > > >     at ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:143
> > > > > 143         vpid = orte_util_print_vpids(name->vpid);
> > > > > (gdb) 
> > > > > orte_util_print_vpids (vpid=0) at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:260
> > > > > 260         ptr = get_print_name_buffer();
> > > > > (gdb) 
> > > > > get_print_name_buffer () at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:92
> > > > > 92          if (!fns_init) {
> > > > > (gdb) 
> > > > > 101         ret = opal_tsd_getspecific(print_args_tsd_key, 
> (void**)&ptr);
> > > > > (gdb) 
> > > > > opal_tsd_getspecific (key=1, valuep=0xffffffff7fffd890)
> > > > >     at ../../openmpi-dev-124-g91e9686/opal/threads/tsd.h:163
> > > > > 163         *valuep = pthread_getspecific(key);
> > > > > (gdb) 
> > > > > 164         return OPAL_SUCCESS;
> > > > > (gdb) 
> > > > > 165     }
> > > > > (gdb) 
> > > > > get_print_name_buffer () at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:102
> > > > > 102         if (OPAL_SUCCESS != ret) return NULL;
> > > > > (gdb) 
> > > > > 104         if (NULL == ptr) {
> > > > > (gdb) 
> > > > > 113         return (orte_print_args_buffers_t*) ptr;
> > > > > (gdb) 
> > > > > 114     }
> > > > > (gdb) 
> > > > > orte_util_print_vpids (vpid=0) at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:262
> > > > > 262         if (NULL == ptr) {
> > > > > (gdb) 
> > > > > 268         if (ORTE_PRINT_NAME_ARG_NUM_BUFS == ptr->cntr) {
> > > > > (gdb) 
> > > > > 272         if (ORTE_VPID_INVALID == vpid) {
> > > > > (gdb) 
> > > > > 274         } else if (ORTE_VPID_WILDCARD == vpid) {
> > > > > (gdb) 
> > > > > 277             snprintf(ptr->buffers[ptr->cntr++], 
> > > > > (gdb) 
> > > > > 281         return ptr->buffers[ptr->cntr-1];
> > > > > (gdb) 
> > > > > 282     }
> > > > > (gdb) 
> > > > > orte_util_print_name_args (name=0x100118380 <orte_process_info+104>)
> > > > >     at ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:146
> > > > > 146         ptr = get_print_name_buffer();
> > > > > (gdb) 
> > > > > get_print_name_buffer () at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:92
> > > > > 92          if (!fns_init) {
> > > > > (gdb) 
> > > > > 101         ret = opal_tsd_getspecific(print_args_tsd_key, 
> (void**)&ptr);
> > > > > (gdb) 
> > > > > opal_tsd_getspecific (key=1, valuep=0xffffffff7fffd950)
> > > > >     at ../../openmpi-dev-124-g91e9686/opal/threads/tsd.h:163
> > > > > 163         *valuep = pthread_getspecific(key);
> > > > > (gdb) 
> > > > > 164         return OPAL_SUCCESS;
> > > > > (gdb) 
> > > > > 165     }
> > > > > (gdb) 
> > > > > get_print_name_buffer () at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:102
> > > > > 102         if (OPAL_SUCCESS != ret) return NULL;
> > > > > (gdb) 
> > > > > 104         if (NULL == ptr) {
> > > > > (gdb) 
> > > > > 113         return (orte_print_args_buffers_t*) ptr;
> > > > > (gdb) 
> > > > > 114     }
> > > > > (gdb) 
> > > > > orte_util_print_name_args (name=0x100118380 <orte_process_info+104>)
> > > > >     at ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:148
> > > > > 148         if (NULL == ptr) {
> > > > > (gdb) 
> > > > > 154         if (ORTE_PRINT_NAME_ARG_NUM_BUFS == ptr->cntr) {
> > > > > (gdb) 
> > > > > 158         snprintf(ptr->buffers[ptr->cntr++], 
> > > > > (gdb) 
> > > > > 162         return ptr->buffers[ptr->cntr-1];
> > > > > (gdb) 
> > > > > 163     }
> > > > > (gdb) 
> > > > > orte_util_print_jobids (job=4294967295) at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:170
> > > > > 170         ptr = get_print_name_buffer();
> > > > > (gdb) 
> > > > > get_print_name_buffer () at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:92
> > > > > 92          if (!fns_init) {
> > > > > (gdb) 
> > > > > 101         ret = opal_tsd_getspecific(print_args_tsd_key, 
> (void**)&ptr);
> > > > > (gdb) 
> > > > > opal_tsd_getspecific (key=1, valuep=0xffffffff7fffd950)
> > > > >     at ../../openmpi-dev-124-g91e9686/opal/threads/tsd.h:163
> > > > > 163         *valuep = pthread_getspecific(key);
> > > > > (gdb) 
> > > > > 164         return OPAL_SUCCESS;
> > > > > (gdb) 
> > > > > 165     }
> > > > > (gdb) 
> > > > > get_print_name_buffer () at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:102
> > > > > 102         if (OPAL_SUCCESS != ret) return NULL;
> > > > > (gdb) 
> > > > > 104         if (NULL == ptr) {
> > > > > (gdb) 
> > > > > 113         return (orte_print_args_buffers_t*) ptr;
> > > > > (gdb) 
> > > > > 114     }
> > > > > (gdb) 
> > > > > orte_util_print_jobids (job=4294967295) at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/name_fns.c:172
> > > > > 172         if (NULL == ptr) {
> > > > > (gdb) 
> > > > > 178         if (ORTE_PRINT_NAME_ARG_NUM_BUFS == ptr->cntr) {
> > > > > (gdb) 
> > > > > 182         if (ORTE_JOBID_INVALID == job) {
> > > > > (gdb) 
> > > > > 183             snprintf(ptr->buffers[ptr->cntr++], 
> > > > > ORTE_PRINT_NAME_ARGS_MAX_SIZE, "[INVALID]");
> > > > > (gdb) 
> > > > > 193         return ptr->buffers[ptr->cntr-1];
> > > > > (gdb) 
> > > > > 194     }
> > > > > (gdb) 
> > > > > orte_job_state_to_str (state=1) at 
> > > > > ../../openmpi-dev-124-g91e9686/orte/util/error_strings.c:217
> > > > > 217         switch(state) {
> > > > > (gdb) 
> > > > > 221             return "PENDING INIT";
> > > > > (gdb) 
> > > > > 317     }
> > > > > (gdb) 
> > > > > opal_output_verbose (level=1, output_id=-1, format=0x1 <Address 0x1 
> > > > > out 
> of 
> > > > > bounds>)
> > > > >     at ../../../openmpi-dev-124-g91e9686/opal/util/output.c:373
> > > > > 373             va_start(arglist, format);
> > > > > (gdb) 
> > > > > 369     {
> > > > > (gdb) 
> > > > > 370         if (output_id >= 0 && output_id < OPAL_OUTPUT_MAX_STREAMS 
> > > > > &&
> > > > > (gdb) 
> > > > > 377     }
> > > > > (gdb) 
> > > > > orte_state_base_activate_job_state (jdata=0x100125250, state=1)
> > > > >     at 
> > > > > 
> ../../../../openmpi-dev-124-g91e9686/orte/mca/state/base/state_base_fns.c:54
> > > > > 54                  if (NULL == s->cbfunc) {
> > > > > (gdb) 
> > > > > 62                  caddy = OBJ_NEW(orte_state_caddy_t);
> > > > > (gdb) 
> > > > > opal_obj_new_debug (type=0xffffffff7f14c7d8 
> > > > > <orte_state_caddy_t_class>, 
> > > > >     file=0xffffffff7f034c08 
> > > > > 
> "../../../../openmpi-dev-124-g91e9686/orte/mca/state/base/state_base_fns.c", 
> > > > > line=62) at 
> ../../../../openmpi-dev-124-g91e9686/opal/class/opal_object.h:249
> > > > > 249         opal_object_t* object = opal_obj_new(type);
> > > > > (gdb) 
> > > > > opal_obj_new (cls=0xffffffff7f14c7d8 <orte_state_caddy_t_class>)
> > > > >     at 
> > > > > ../../../../openmpi-dev-124-g91e9686/opal/class/opal_object.h:465
> > > > > 465         assert(cls->cls_sizeof >= sizeof(opal_object_t));
> > > > > (gdb) 
> > > > > 470         object = (opal_object_t *) malloc(cls->cls_sizeof);
> > > > > (gdb) 
> > > > > 472         if (0 == cls->cls_initialized) {
> > > > > (gdb) 
> > > > > 473             opal_class_initialize(cls);
> > > > > (gdb) 
> > > > > opal_class_initialize (cls=0xffffffff7f14c7d8 
> <orte_state_caddy_t_class>)
> > > > >     at ../../openmpi-dev-124-g91e9686/opal/class/opal_object.c:79
> > > > > 79          assert(cls);
> > > > > (gdb) 
> > > > > 84          if (1 == cls->cls_initialized) {
> > > > > (gdb) 
> > > > > 87          opal_atomic_lock(&class_lock);
> > > > > (gdb) 
> > > > > opal_atomic_lock (lock=0xffffffff7ee89bf0 <class_lock>)
> > > > >     at 
> ../../openmpi-dev-124-g91e9686/opal/include/opal/sys/atomic_impl.h:397
> > > > > 397        while( !opal_atomic_cmpset_acq_32( &(lock->u.lock),
> > > > > (gdb) 
> > > > > opal_atomic_cmpset_acq_32 (addr=0xffffffff7ee89bf0 <class_lock>, 
> oldval=0, 
> > > > > newval=1)
> > > > >     at 
> ../../openmpi-dev-124-g91e9686/opal/include/opal/sys/sparcv9/atomic.h:107
> > > > > 107        rc = opal_atomic_cmpset_32(addr, oldval, newval);
> > > > > (gdb) 
> > > > > opal_atomic_cmpset_32 (addr=0xffffffff7ee89bf0 <class_lock>, 
> > > > > oldval=0, 
> newval=1)
> > > > >     at 
> ../../openmpi-dev-124-g91e9686/opal/include/opal/sys/sparcv9/atomic.h:93
> > > > > 93         int32_t ret = newval;
> > > > > (gdb) 
> > > > > 95         __asm__ __volatile__("casa [%1] " ASI_P ", %2, %0"
> > > > > (gdb) 
> > > > > 98         return (ret == oldval);
> > > > > (gdb) 
> > > > > 99      }
> > > > > (gdb) 
> > > > > opal_atomic_cmpset_acq_32 (addr=0xffffffff7ee89bf0 <class_lock>, 
> oldval=0, 
> > > > > newval=1)
> > > > >     at 
> ../../openmpi-dev-124-g91e9686/opal/include/opal/sys/sparcv9/atomic.h:108
> > > > > 108        opal_atomic_rmb();
> > > > > (gdb) 
> > > > > opal_atomic_rmb () at 
> > > > > ../../openmpi-dev-124-g91e9686/opal/include/opal/sys/sparcv9/atomic.h:63
> > > > > 63          MEMBAR("#LoadLoad");
> > > > > (gdb) 
> > > > > 64      }
> > > > > (gdb) 
> > > > > opal_atomic_cmpset_acq_32 (addr=0xffffffff7ee89bf0 <class_lock>, 
> oldval=0, 
> > > > > newval=1)
> > > > >     at 
> ../../openmpi-dev-124-g91e9686/opal/include/opal/sys/sparcv9/atomic.h:110
> > > > > 110        return rc;
> > > > > (gdb) 
> > > > > 111     }
> > > > > (gdb) 
> > > > > opal_atomic_lock (lock=0xffffffff7ee89bf0 <class_lock>)
> > > > >     at 
> ../../openmpi-dev-124-g91e9686/opal/include/opal/sys/atomic_impl.h:403
> > > > > 403     }
> > > > > (gdb) 
> > > > > opal_class_initialize (cls=0xffffffff7f14c7d8 
> <orte_state_caddy_t_class>)
> > > > >     at ../../openmpi-dev-124-g91e9686/opal/class/opal_object.c:93
> > > > > 93          if (1 == cls->cls_initialized) {
> > > > > (gdb) 
> > > > > 103         cls->cls_depth = 0;
> > > > > (gdb) 
> > > > > 104         cls_construct_array_count = 0;
> > > > > (gdb) 
> > > > > 105         cls_destruct_array_count  = 0;
> > > > > (gdb) 
> > > > > 106         for (c = cls; c; c = c->cls_parent) {
> > > > > (gdb) 
> > > > > 107             if( NULL != c->cls_construct ) {
> > > > > (gdb) 
> > > > > 108                 cls_construct_array_count++;
> > > > > (gdb) 
> > > > > 110             if( NULL != c->cls_destruct ) {
> > > > > (gdb) 
> > > > > 111                 cls_destruct_array_count++;
> > > > > (gdb) 
> > > > > 113             cls->cls_depth++;
> > > > > (gdb) 
> > > > > 106         for (c = cls; c; c = c->cls_parent) {
> > > > > (gdb) 
> > > > > 107             if( NULL != c->cls_construct ) {
> > > > > (gdb) 
> > > > > 110             if( NULL != c->cls_destruct ) {
> > > > > (gdb) 
> > > > > 113             cls->cls_depth++;
> > > > > (gdb) 
> > > > > 106         for (c = cls; c; c = c->cls_parent) {
> > > > > (gdb) 
> > > > > 122             (void 
> (**)(opal_object_t*))malloc((cls_construct_array_count +
> > > > > (gdb) 
> > > > > 123                                                
> cls_destruct_array_count + 2) 
> > > > > *
> > > > > (gdb) 
> > > > > 122             (void 
> (**)(opal_object_t*))malloc((cls_construct_array_count +
> > > > > (gdb) 
> > > > > 121         cls->cls_construct_array = 
> > > > > (gdb) 
> > > > > 125         if (NULL == cls->cls_construct_array) {
> > > > > (gdb) 
> > > > > 130             cls->cls_construct_array + cls_construct_array_count 
> > > > > + 
> 1;
> > > > > (gdb) 
> > > > > 129         cls->cls_destruct_array =
> > > > > (gdb) 
> > > > > 136         cls_construct_array = cls->cls_construct_array + 
> > > > > cls_construct_array_count;
> > > > > (gdb) 
> > > > > 137         cls_destruct_array  = cls->cls_destruct_array;
> > > > > (gdb) 
> > > > > 139         c = cls;
> > > > > (gdb) 
> > > > > 140         *cls_construct_array = NULL;  /* end marker for the 
> constructors */
> > > > > (gdb) 
> > > > > 141         for (i = 0; i < cls->cls_depth; i++) {
> > > > > (gdb) 
> > > > > 142             if( NULL != c->cls_construct ) {
> > > > > (gdb) 
> > > > > 143                 --cls_construct_array;
> > > > > (gdb) 
> > > > > 144                 *cls_construct_array = c->cls_construct;
> > > > > (gdb) 
> > > > > 146             if( NULL != c->cls_destruct ) {
> > > > > (gdb) 
> > > > > 147                 *cls_destruct_array = c->cls_destruct;
> > > > > (gdb) 
> > > > > 148                 cls_destruct_array++;
> > > > > (gdb) 
> > > > > 150             c = c->cls_parent;
> > > > > (gdb) 
> > > > > 141         for (i = 0; i < cls->cls_depth; i++) {
> > > > > (gdb) 
> > > > > 142             if( NULL != c->cls_construct ) {
> > > > > (gdb) 
> > > > > 146             if( NULL != c->cls_destruct ) {
> > > > > (gdb) 
> > > > > 150             c = c->cls_parent;
> > > > > (gdb) 
> > > > > 141         for (i = 0; i < cls->cls_depth; i++) {
> > > > > (gdb) 
> > > > > 152         *cls_destruct_array = NULL;  /* end marker for the 
> destructors */
> > > > > (gdb) 
> > > > > 154         cls->cls_initialized = 1;
> > > > > (gdb) 
> > > > > 155         save_class(cls);
> > > > > (gdb) 
> > > > > save_class (cls=0xffffffff7f14c7d8 <orte_state_caddy_t_class>)
> > > > >     at ../../openmpi-dev-124-g91e9686/opal/class/opal_object.c:188
> > > > > 188         if (num_classes >= max_classes) {
> > > > > (gdb) 
> > > > > 189             expand_array();
> > > > > (gdb) 
> > > > > expand_array () at 
> ../../openmpi-dev-124-g91e9686/opal/class/opal_object.c:201
> > > > > 201         max_classes += increment;
> > > > > (gdb) 
> > > > > 202         classes = (void**)realloc(classes, sizeof(opal_class_t*) 
> > > > > * 
> > > > > max_classes);
> > > > > (gdb) 
> > > > > 203         if (NULL == classes) {
> > > > > (gdb) 
> > > > > 207         for (i = num_classes; i < max_classes; ++i) {
> > > > > (gdb) 
> > > > > 208             classes[i] = NULL;
> > > > > (gdb) 
> > > > > 207         for (i = num_classes; i < max_classes; ++i) {
> > > > > (gdb) 
> > > > > 208             classes[i] = NULL;
> > > > > (gdb) 
> > > > > 207         for (i = num_classes; i < max_classes; ++i) {
> > > > > (gdb) 
> > > > > 208             classes[i] = NULL;
> > > > > (gdb) 
> > > > > 207         for (i = num_classes; i < max_classes; ++i) {
> > > > > (gdb) 
> > > > > 208             classes[i] = NULL;
> > > > > (gdb) 
> > > > > 207         for (i = num_classes; i < max_classes; ++i) {
> > > > > (gdb) 
> > > > > 208             classes[i] = NULL;
> > > > > (gdb) 
> > > > > 207         for (i = num_classes; i < max_classes; ++i) {
> > > > > (gdb) 
> > > > > 208             classes[i] = NULL;
> > > > > (gdb) 
> > > > > 207         for (i = num_classes; i < max_classes; ++i) {
> > > > > (gdb) 
> > > > > 208             classes[i] = NULL;
> > > > > (gdb) 
> > > > > 207         for (i = num_classes; i < max_classes; ++i) {
> > > > > (gdb) 
> > > > > 208             classes[i] = NULL;
> > > > > (gdb) 
> > > > > 207         for (i = num_classes; i < max_classes; ++i) {
> > > > > (gdb) 
> > > > > 208             classes[i] = NULL;
> > > > > (gdb) 
> > > > > 207         for (i = num_classes; i < max_classes; ++i) {
> > > > > (gdb) 
> > > > > 208             classes[i] = NULL;
> > > > > (gdb) 
> > > > > 207         for (i = num_classes; i < max_classes; ++i) {
> > > > > (gdb) 
> > > > > 210     }
> > > > > (gdb) 
> > > > > save_class (cls=0xffffffff7f14c7d8 <orte_state_caddy_t_class>)
> > > > >     at ../../openmpi-dev-124-g91e9686/opal/class/opal_object.c:192
> > > > > 192         classes[num_classes] = cls->cls_construct_array;
> > > > > (gdb) 
> > > > > 193         ++num_classes;
> > > > > (gdb) 
> > > > > 194     }
> > > > > (gdb) 
> > > > > opal_class_initialize (cls=0xffffffff7f14c7d8 
> <orte_state_caddy_t_class>)
> > > > >     at ../../openmpi-dev-124-g91e9686/opal/class/opal_object.c:159
> > > > > 159         opal_atomic_unlock(&class_lock);
> > > > > (gdb) 
> > > > > opal_atomic_unlock (lock=0xffffffff7ee89bf0 <class_lock>)
> > > > >     at 
> ../../openmpi-dev-124-g91e9686/opal/include/opal/sys/atomic_impl.h:409
> > > > > 409        opal_atomic_wmb();
> > > > > (gdb) 
> > > > > opal_atomic_wmb () at 
> > > > > ../../openmpi-dev-124-g91e9686/opal/include/opal/sys/sparcv9/atomic.h:69
> > > > > 69          MEMBAR("#StoreStore");
> > > > > (gdb) 
> > > > > 70      }
> > > > > (gdb) 
> > > > > opal_atomic_unlock (lock=0xffffffff7ee89bf0 <class_lock>)
> > > > >     at 
> ../../openmpi-dev-124-g91e9686/opal/include/opal/sys/atomic_impl.h:410
> > > > > 410        lock->u.lock=OPAL_ATOMIC_UNLOCKED;
> > > > > (gdb) 
> > > > > 411     }
> > > > > (gdb) 
> > > > > opal_class_initialize (cls=0xffffffff7f14c7d8 
> <orte_state_caddy_t_class>)
> > > > >     at ../../openmpi-dev-124-g91e9686/opal/class/opal_object.c:160
> > > > > 160     }
> > > > > (gdb) 
> > > > > opal_obj_new (cls=0xffffffff7f14c7d8 <orte_state_caddy_t_class>)
> > > > >     at 
> > > > > ../../../../openmpi-dev-124-g91e9686/opal/class/opal_object.h:475
> > > > > 475         if (NULL != object) {
> > > > > (gdb) 
> > > > > 476             object->obj_class = cls;
> > > > > (gdb) 
> > > > > 477             object->obj_reference_count = 1;
> > > > > (gdb) 
> > > > > 478             opal_obj_run_constructors(object);
> > > > > (gdb) 
> > > > > opal_obj_run_constructors (object=0x1001bfcf0)
> > > > >     at 
> > > > > ../../../../openmpi-dev-124-g91e9686/opal/class/opal_object.h:420
> > > > > 420         assert(NULL != object->obj_class);
> > > > > (gdb) 
> > > > > 422         cls_construct = object->obj_class->cls_construct_array;
> > > > > (gdb) 
> > > > > 423         while( NULL != *cls_construct ) {
> > > > > (gdb) 
> > > > > 424             (*cls_construct)(object);
> > > > > (gdb) 
> > > > > orte_state_caddy_construct (caddy=0x1001bfcf0)
> > > > >     at 
> > > > > 
> ../../../../openmpi-dev-124-g91e9686/orte/mca/state/base/state_base_frame.c:84
> > > > > 84          memset(&caddy->ev, 0, sizeof(opal_event_t));
> > > > > (gdb) 
> > > > > 85          caddy->jdata = NULL;
> > > > > (gdb) 
> > > > > 86      }
> > > > > (gdb) 
> > > > > opal_obj_run_constructors (object=0x1001bfcf0)
> > > > >     at 
> > > > > ../../../../openmpi-dev-124-g91e9686/opal/class/opal_object.h:425
> > > > > 425             cls_construct++;
> > > > > (gdb) 
> > > > > 423         while( NULL != *cls_construct ) {
> > > > > (gdb) 
> > > > > 427     }
> > > > > (gdb) 
> > > > > opal_obj_new (cls=0xffffffff7f14c7d8 <orte_state_caddy_t_class>)
> > > > >     at 
> > > > > ../../../../openmpi-dev-124-g91e9686/opal/class/opal_object.h:480
> > > > > 480         return object;
> > > > > (gdb) 
> > > > > 481     }
> > > > > (gdb) 
> > > > > opal_obj_new_debug (type=0xffffffff7f14c7d8 
> > > > > <orte_state_caddy_t_class>, 
> > > > >     file=0xffffffff7f034c08 
> > > > > 
> "../../../../openmpi-dev-124-g91e9686/orte/mca/state/base/state_base_fns.c", 
> > > > > line=62) at 
> ../../../../openmpi-dev-124-g91e9686/opal/class/opal_object.h:250
> > > > > 250         object->obj_magic_id = OPAL_OBJ_MAGIC_ID;
> > > > > (gdb) 
> > > > > 251         object->cls_init_file_name = file;
> > > > > (gdb) 
> > > > > 252         object->cls_init_lineno = line;
> > > > > (gdb) 
> > > > > 253         return object;
> > > > > (gdb) 
> > > > > 254     }
> > > > > (gdb) 
> > > > > orte_state_base_activate_job_state (jdata=0x100125250, state=1)
> > > > >     at 
> > > > > 
> ../../../../openmpi-dev-124-g91e9686/orte/mca/state/base/state_base_fns.c:63
> > > > > 63                  if (NULL != jdata) {
> > > > > (gdb) 
> > > > > 64                      caddy->jdata = jdata;
> > > > > (gdb) 
> > > > > 65                      caddy->job_state = state;
> > > > > (gdb) 
> > > > > 66                      OBJ_RETAIN(jdata);
> > > > > (gdb) 
> > > > > opal_obj_update (inc=1, object=0x100125250)
> > > > >     at 
> > > > > ../../../../openmpi-dev-124-g91e9686/opal/class/opal_object.h:497
> > > > > 497         return opal_atomic_add_32(&(object->obj_reference_count), 
> inc);
> > > > > (gdb) 
> > > > > opal_atomic_add_32 (addr=0x100125260, delta=1)
> > > > >     at 
> > > > > 
> ../../../../openmpi-dev-124-g91e9686/opal/include/opal/sys/atomic_impl.h:63
> > > > > 63            oldval = *addr;
> > > > > (gdb) 
> > > > > 64         } while (0 == opal_atomic_cmpset_32(addr, oldval, oldval + 
> delta));
> > > > > (gdb) 
> > > > > opal_atomic_cmpset_32 (addr=0x100125260, oldval=1, newval=2)
> > > > >     at 
> > > > > 
> ../../../../openmpi-dev-124-g91e9686/opal/include/opal/sys/sparcv9/atomic.h:93
> > > > > 93         int32_t ret = newval;
> > > > > (gdb) 
> > > > > 95         __asm__ __volatile__("casa [%1] " ASI_P ", %2, %0"
> > > > > (gdb) 
> > > > > 98         return (ret == oldval);
> > > > > (gdb) 
> > > > > 99      }
> > > > > (gdb) 
> > > > > opal_atomic_add_32 (addr=0x100125260, delta=1)
> > > > >     at 
> > > > > 
> ../../../../openmpi-dev-124-g91e9686/opal/include/opal/sys/atomic_impl.h:65
> > > > > 65         return (oldval + delta);
> > > > > (gdb) 
> > > > > 66      }
> > > > > (gdb) 
> > > > > orte_state_base_activate_job_state (jdata=0x100125250, state=1)
> > > > >     at 
> > > > > 
> ../../../../openmpi-dev-124-g91e9686/orte/mca/state/base/state_base_fns.c:66
> > > > > 66                      OBJ_RETAIN(jdata);
> > > > > (gdb) 
> > > > > 68                  opal_event_set(orte_event_base, &caddy->ev, -1, 
> > > > > OPAL_EV_WRITE, s->cbfunc, caddy);
> > > > > (gdb) 
> > > > > 69                  opal_event_set_priority(&caddy->ev, s->priority);
> > > > > (gdb) 
> > > > > 70                  opal_event_active(&caddy->ev, OPAL_EV_WRITE, 1);
> > > > > (gdb) 
> > > > > 71                  return;
> > > > > (gdb) 
> > > > > 105     }
> > > > > (gdb) 
> > > > > rsh_launch (jdata=0x100125250)
> > > > >     at 
> > > > > 
> ../../../../../openmpi-dev-124-g91e9686/orte/mca/plm/rsh/plm_rsh_module.c:883
> > > > > 883         return ORTE_SUCCESS;
> > > > > (gdb) 
> > > > > 884     }
> > > > > (gdb) 
> > > > > orterun (argc=5, argv=0xffffffff7fffe0d8)
> > > > >     at 
> ../../../../openmpi-dev-124-g91e9686/orte/tools/orterun/orterun.c:1084
> > > > > 1084        while (orte_event_base_active) {
> > > > > (gdb) 
> > > > > 1085            opal_event_loop(orte_event_base, OPAL_EVLOOP_ONCE);
> > > > > (gdb) 
> > > > > 1084        while (orte_event_base_active) {
> > > > > (gdb) 
> > > > > 1085            opal_event_loop(orte_event_base, OPAL_EVLOOP_ONCE);
> > > > > (gdb) 
> > > > > 1084        while (orte_event_base_active) {
> > > > > (gdb) 
> > > > > 1085            opal_event_loop(orte_event_base, OPAL_EVLOOP_ONCE);
> > > > > (gdb) 
> > > > > #
> > > > > # A fatal error has been detected by the Java Runtime Environment:
> > > > > #
> > > > > #  SIGSEGV (0xb) at pc=0xffffffff7ea3c7f0, pid=13080, tid=2
> > > > > #
> > > > > # JRE version: Java(TM) SE Runtime Environment (8.0-b132) (build 
> 1.8.0-b132)
> > > > > # Java VM: Java HotSpot(TM) 64-Bit Server VM (25.0-b70 mixed mode 
> solaris-sparc 
> > > > > compressed oops)
> > > > > # Problematic frame:
> > > > > # 1084      while (orte_event_base_active) {
> > > > > (gdb) 
> > > > > 1085            opal_event_loop(orte_event_base, OPAL_EVLOOP_ONCE);
> > > > > (gdb) 
> > > > > C  [libc.so.1+0x3c7f0]  strlen+0x50
> > > > > #
> > > > > # Failed to write core dump. Core dumps have been disabled. To enable 
> core 
> > > > > dumping, try "ulimit -c unlimited" before starting Java again
> > > > > #
> > > > > # An error report file with more information is saved as:
> > > > > # 
> /home/fd1026/work/skripte/master/parallel/prog/mpi/java/hs_err_pid13080.log
> > > > > #
> > > > > # If you would like to submit a bug report, please visit:
> > > > > #   http://bugreport.sun.com/bugreport/crash.jsp
> > > > > # The crash happened outside the Java Virtual Machine in native code.
> > > > > # See problematic frame for where to report the bug.
> > > > > #
> > > > > 
> --------------------------------------------------------------------------
> > > > > mpiexec noticed that process rank 0 with PID 0 on node tyr exited on 
> signal 6 
> > > > > (Abort).
> > > > > 
> --------------------------------------------------------------------------
> > > > > 1084        while (orte_event_base_active) {
> > > > > (gdb) 
> > > > > 1089        orte_odls.kill_local_procs(NULL);
> > > > > (gdb) 
> > > > > 
> > > > > 
> > > > > Thank you very much for any help in advance.
> > > > > 
> > > > > Kind regards
> > > > > 
> > > > > Siegmar

Reply via email to