FWIW, the stack trace is telling you that it segv'ed in a printf in the main() 
function of your application.  If it dumped core, you can just attach to the 
core file and see exactly where it died.


On Jul 25, 2010, at 10:08 PM, Jack Bryan wrote:

> Dear All,
> 
> I run a 6 parallel processes on OpenMPI. 
> 
> When the run-time of the program is short, it works well.
> 
> But, if the run-time is long, I got errors: 
> 
> [n124:45521] *** Process received signal ***
> [n124:45521] Signal: Segmentation fault (11)
> [n124:45521] Signal code: Address not mapped (1)
> [n124:45521] Failing at address: 0x44
> [n124:45521] [ 0] /lib64/libpthread.so.0 [0x3c50e0e4c0]
> [n124:45521] [ 1] /lib64/libc.so.6(strlen+0x10) [0x3c50278d60]
> [n124:45521] [ 2] /lib64/libc.so.6(_IO_vfprintf+0x4479) [0x3c50246b19]
> [n124:45521] [ 3] /lib64/libc.so.6(_IO_printf+0x9a) [0x3c5024d3aa]
> [n124:45521] [ 4] /home/path/exec [0x40ec9a]
> [n124:45521] [ 5] /lib64/libc.so.6(__libc_start_main+0xf4) [0x3c5021d974]
> [n124:45521] [ 6] /home/path/exec [0x401139]
> [n124:45521] *** End of error message ***
> 
> It seems that there may be some problems about memory management. 
> 
> But, I cannot find the reason. 
> 
> My program needs to write results to some files. 
> 
> If I open the files too many without closing them, I may get the above 
> errors. 
> 
> But, I have removed the writing files from my program. 
> 
> The problem appears again when the program runs longer time. 
> 
> Any help is appreciated. 
> 
> Jack
> 
> July 25  2010
> 
> 
> Hotmail is redefining busy with tools for the New Busy. Get more from your 
> inbox. See how. _______________________________________________
> users mailing list
> us...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/users


-- 
Jeff Squyres
jsquy...@cisco.com
For corporate legal information go to:
http://www.cisco.com/web/about/doing_business/legal/cri/


Reply via email to