Also see http://www.open-mpi.org/faq/?category=debugging.

On Jul 1, 2010, at 3:17 AM, Asad Ali wrote:

> Hi Jack,
> 
> Debugging OpenMPI with traditional debuggers is a pain.
> >From your error message it sounds that you have some memory allocation 
> >problem. Do you use dynamic memory allocation (allocate and then free)?
> 
> I use display (printf()) command with MPIrank command. It tells me which 
> thread is giving segmentation fault.
> 
> Cheers,
> 
> Asad
> 
> On Thu, Jul 1, 2010 at 4:13 PM, Jack Bryan <dtustud...@hotmail.com> wrote:
> thanks
> 
> I am not familiar with OpenMPI. 
> 
> Would you please help me with how to ask openMPI to show where the fault 
> occurs ?
> 
> GNU debuger ?
> 
> Any help is appreciated. 
> 
> thanks!!!
> 
> Jack 
> 
> June 30  2010
> 
> Date: Wed, 30 Jun 2010 16:13:09 -0400
> From: amja...@gmail.com
> To: us...@open-mpi.org
> Subject: Re: [OMPI users] Open MPI, Segmentation fault
> 
> 
> Based on my experiences, I would FULLY endorse (100% agree with) David Zhang.
> It is usually a coding or typo mistake.
> 
> At first, Ensure that array sizes and dimension are correct.
> 
> I experience that if openmpi is compiled with gnu compilers (not with Intel) 
> then it also point outs the subroutine exactly in which the fault occur. have 
> a try.
> 
> best,
> AA
> 
>   
> 
> On Wed, Jun 30, 2010 at 12:43 PM, David Zhang <solarbik...@gmail.com> wrote:
> When I got segmentation faults, it has always been my coding mistakes.  
> Perhaps your code is not robust against number of processes not divisible by 
> 2?
> 
> On Wed, Jun 30, 2010 at 8:47 AM, Jack Bryan <dtustud...@hotmail.com> wrote:
> Dear All,
> 
> I am using Open MPI, I got the error: 
> 
> n337:37664] *** Process received signal ***
> [n337:37664] Signal: Segmentation fault (11)
> [n337:37664] Signal code: Address not mapped (1)
> [n337:37664] Failing at address: 0x7fffcfe90000
> [n337:37664] [ 0] /lib64/libpthread.so.0 [0x3c50e0e4c0]
> [n337:37664] [ 1] /lustre/home/rhascheduler/RhaScheduler-0.4.1.1/mytest/nmn2 
> [0x414ed7]
> [n337:37664] [ 2] /lib64/libc.so.6(__libc_start_main+0xf4) [0x3c5021d974]
> [n337:37664] [ 3] 
> /lustre/home/rhascheduler/RhaScheduler-0.4.1.1/mytest/nmn2(__gxx_personality_v0+0x1f1)
>  [0x412139]
> [n337:37664] *** End of error message ***
> 
> After searching answers, it seems that some functions fail. 
>  
> My program can run well for 1,2,10 processors, but fail when the number of 
> tasks cannot
> be divided evenly by number of processes. 
> 
> Any help is appreciated. 
> 
> thanks
> 
> Jack
> 
> June 30  2010
> 
> 
> The New Busy think 9 to 5 is a cute idea. Combine multiple calendars with 
> Hotmail. Get busy.
> 
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/users
> 
> 
> 
> -- 
> David Zhang
> University of California, San Diego
> 
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/users
> 
> 
> Hotmail has tools for the New Busy. Search, chat and e-mail from your inbox. 
> Learn more.
> 
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/users
> 
> 
> 
> -- 
> "Statistical thinking will one day be as necessary for efficient citizenship 
> as the ability to read and write." - H.G. Wells
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/users


-- 
Jeff Squyres
jsquy...@cisco.com
For corporate legal information go to:
http://www.cisco.com/web/about/doing_business/legal/cri/


Reply via email to