Hi Jack, Debugging OpenMPI with traditional debuggers is a pain. >From your error message it sounds that you have some memory allocation problem. Do you use dynamic memory allocation (allocate and then free)?
I use display (printf()) command with MPIrank command. It tells me which thread is giving segmentation fault. Cheers, Asad On Thu, Jul 1, 2010 at 4:13 PM, Jack Bryan <dtustud...@hotmail.com> wrote: > thanks > > I am not familiar with OpenMPI. > > Would you please help me with how to ask openMPI to show where the fault > occurs ? > > GNU debuger ? > > Any help is appreciated. > > thanks!!! > > Jack > > June 30 2010 > > ------------------------------ > Date: Wed, 30 Jun 2010 16:13:09 -0400 > From: amja...@gmail.com > To: us...@open-mpi.org > Subject: Re: [OMPI users] Open MPI, Segmentation fault > > > Based on my experiences, I would FULLY endorse (100% agree with) David > Zhang. > It is usually a coding or typo mistake. > > At first, Ensure that array sizes and dimension are correct. > > I experience that if openmpi is compiled with gnu compilers (not with > Intel) then it also point outs the subroutine exactly in which the fault > occur. have a try. > > best, > AA > > > > On Wed, Jun 30, 2010 at 12:43 PM, David Zhang <solarbik...@gmail.com>wrote: > > When I got segmentation faults, it has always been my coding mistakes. > Perhaps your code is not robust against number of processes not divisible by > 2? > > On Wed, Jun 30, 2010 at 8:47 AM, Jack Bryan <dtustud...@hotmail.com>wrote: > > Dear All, > > I am using Open MPI, I got the error: > > n337:37664] *** Process received signal *** > [n337:37664] Signal: Segmentation fault (11) > [n337:37664] Signal code: Address not mapped (1) > [n337:37664] Failing at address: 0x7fffcfe90000 > [n337:37664] [ 0] /lib64/libpthread.so.0 [0x3c50e0e4c0] > [n337:37664] [ 1] > /lustre/home/rhascheduler/RhaScheduler-0.4.1.1/mytest/nmn2 [0x414ed7] > [n337:37664] [ 2] /lib64/libc.so.6(__libc_start_main+0xf4) [0x3c5021d974] > [n337:37664] [ 3] > /lustre/home/rhascheduler/RhaScheduler-0.4.1.1/mytest/nmn2(__gxx_personality_v0+0x1f1) > [0x412139] > [n337:37664] *** End of error message *** > > After searching answers, it seems that some functions fail. > > My program can run well for 1,2,10 processors, but fail when the number of > tasks cannot > be divided evenly by number of processes. > > Any help is appreciated. > > thanks > > Jack > > June 30 2010 > > > ------------------------------ > The New Busy think 9 to 5 is a cute idea. Combine multiple calendars with > Hotmail. Get > busy.<http://www.windowslive.com/campaign/thenewbusy?tile=multicalendar&ocid=PID28326::T:WLMTAGL:ON:WL:en-US:WM_HMP:042010_5> > > _______________________________________________ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users > > > > > -- > David Zhang > University of California, San Diego > > _______________________________________________ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users > > > > ------------------------------ > Hotmail has tools for the New Busy. Search, chat and e-mail from your > inbox. Learn > more.<http://www.windowslive.com/campaign/thenewbusy?ocid=PID28326::T:WLMTAGL:ON:WL:en-US:WM_HMP:042010_1> > > _______________________________________________ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users > -- "Statistical thinking will one day be as necessary for efficient citizenship as the ability to read and write." - H.G. Wells