hi - I have an application that consistently segfault when I do "mpirun --oversubscribe" and the following message came AFTER application runs. My running environment: MacOS with openmpi 3.1.2.
Is this a problme with my application? or my environment? any help? thanks Oliver -------------------------------------------------------------------------- A system call failed during shared memory initialization that should not have. It is likely that your MPI job will now either abort or experience performance degradation. Local host: pi.local System call: unlink(2) /var/folders/h2/ph7pgd4n3_z9v2pd0hk5nc6w0000gn/T//ompi.pi.501/pid.45364/1/vader_segment.pi.c1c00001.7 Error: No such file or directory (errno 2) -------------------------------------------------------------------------- mpirun(45364,0x70000e1c9000) malloc: *** mach_vm_map(size=1125899906846720) failed (error code=3) *** error: can't allocate region *** set a breakpoint in malloc_error_break to debug [pi:45364] *** Process received signal *** [pi:45364] Signal: Segmentation fault: 11 (11) [pi:45364] Signal code: Address not mapped (1) [pi:45364] Failing at address: 0x0 [pi:45364] [ 0] 0 libsystem_platform.dylib 0x00007fff7d999f5a _sigtramp + 26 [pi:45364] [ 1] 0 ??? 0x000000002d595060 0x0 + 760828000 [pi:45364] [ 2] 0 mca_rml_oob.so 0x0000000103aeadaf orte_rml_oob_send_buffer_nb + 956 [pi:45364] [ 3] 0 libopen-rte.40.dylib 0x000000010357d0fa pmix_server_log_fn + 449 [pi:45364] [ 4] 0 mca_pmix_pmix2x.so 0x000000010394f6d6 server_log + 857 [pi:45364] [ 5] 0 mca_pmix_pmix2x.so 0x0000000103982d42 pmix_server_log + 1257 [pi:45364] [ 6] 0 mca_pmix_pmix2x.so 0x00000001039731e0 server_message_handler + 5032 [pi:45364] [ 7] 0 mca_pmix_pmix2x.so 0x00000001039a9822 pmix_ptl_base_process_msg + 723 [pi:45364] [ 8] 0 libevent-2.1.6.dylib 0x00000001036b6719 event_process_active_single_queue + 376 [pi:45364] [ 9] 0 libevent-2.1.6.dylib 0x00000001036b3cb3 event_base_loop + 1074 [pi:45364] [10] 0 mca_pmix_pmix2x.so 0x0000000103988ce7 progress_engine + 26 [pi:45364] [11] 0 libsystem_pthread.dylib 0x00007fff7d9a3661 _pthread_body + 340 [pi:45364] [12] 0 libsystem_pthread.dylib 0x00007fff7d9a350d _pthread_body + 0 [pi:45364] [13] 0 libsystem_pthread.dylib 0x00007fff7d9a2bf9 thread_start + 13 [pi:45364] *** End of error message *** Segmentation fault: 11 -- Oliver
_______________________________________________ users mailing list users@lists.open-mpi.org https://lists.open-mpi.org/mailman/listinfo/users