What Jeff is suggesting is probably valgrind. However, in my experience, which is much less than most OpenMPI developers, a simple code inspection often is adequate. Here are the steps:
1. If you don't already have it, build a debug version of your code. If you are using gcc, you'd use a -g to CFLAGS on your makefile for C programs (adding -g3, taking out any -O flags is better) 2. Have your shell generate a core dump when the crash happens. 3. Launch gdb with the debug image and core file I have had near 100% luck in detecting sources of SEGV-type crash using the steps above, but your mileage may vary. If you are not familiar with gdb, you may be able to enlist someone local who does. We learn from history that we never learn from history. On Thu, May 10, 2018 at 5:47 AM, Ankita m <ankitamait...@gmail.com> wrote: > ok...Thank you so much sir > > On Wed, May 9, 2018 at 11:13 PM, Jeff Squyres (jsquyres) > <jsquy...@cisco.com> wrote: >> >> It looks like you're getting a segv when calling MPI_Comm_rank(). >> >> This is quite unusual -- MPI_Comm_rank() is just a local lookup / return >> of an integer. If MPI_Comm_rank() is seg faulting, it usually indicates >> that there's some other kind of memory error in the application, and this >> seg fault you're seeing is just a symptom -- it's not the real problem. It >> may have worked with Intel MPI by chance, or for some reason, Intel MPI has >> a different memory pattern than Open MPI and it didn't happen to trigger >> this exact problem. >> >> You might want to run your application through a memory-checking debugger. >> >> >> >> > On May 9, 2018, at 11:39 AM, Ankita m <ankitamait...@gmail.com> wrote: >> > >> > yes. Because previously i was using intel-mpi. That time the program was >> > running perfectly. Now when i use openmpi this shows this error >> > files...Though i am not quite sure. I just thought if the issue will be for >> > Openmpi then i could get some help here. >> > >> > On Wed, May 9, 2018 at 6:47 PM, Gilles Gouaillardet >> > <gilles.gouaillar...@gmail.com> wrote: >> > Ankita, >> > >> > Do you have any reason to suspect the root cause of the crash is Open >> > MPI ? >> > >> > Cheers, >> > >> > Gilles >> > >> > >> > On Wednesday, May 9, 2018, Ankita m <ankitamait...@gmail.com> wrote: >> > MPI "Hello World" program is also working >> > >> > please see this error file attached below. its of a different program >> > >> > On Wed, May 9, 2018 at 4:10 PM, John Hearns via users >> > <users@lists.open-mpi.org> wrote: >> > Ankita, looks like your program is not launching correctly. >> > I would try the following: >> > define two hosts in a machinefile. Use mpirun -np 2 machinefile date >> > Ie can you use mpirun just to run the command 'date' >> > >> > Secondly compile up and try to run an MPI 'Hello World' program >> > >> > >> > On 9 May 2018 at 12:28, Ankita m <ankitamait...@gmail.com> wrote: >> > I am using ompi -3.1.0 version in my program and compiler is mpicc >> > >> > its a parallel program which uses multiple nodes with 16 cores in each >> > node. >> > >> > but its not working and generates a error file . i Have attached the >> > error file below. >> > >> > can anyone please tell what is the issue actually >> > >> > _______________________________________________ >> > users mailing list >> > users@lists.open-mpi.org >> > https://lists.open-mpi.org/mailman/listinfo/users >> > >> > >> > _______________________________________________ >> > users mailing list >> > users@lists.open-mpi.org >> > https://lists.open-mpi.org/mailman/listinfo/users >> > >> > >> > _______________________________________________ >> > users mailing list >> > users@lists.open-mpi.org >> > https://lists.open-mpi.org/mailman/listinfo/users >> > >> > _______________________________________________ >> > users mailing list >> > users@lists.open-mpi.org >> > https://lists.open-mpi.org/mailman/listinfo/users >> >> >> -- >> Jeff Squyres >> jsquy...@cisco.com >> >> _______________________________________________ >> users mailing list >> users@lists.open-mpi.org >> https://lists.open-mpi.org/mailman/listinfo/users > > > > _______________________________________________ > users mailing list > users@lists.open-mpi.org > https://lists.open-mpi.org/mailman/listinfo/users _______________________________________________ users mailing list users@lists.open-mpi.org https://lists.open-mpi.org/mailman/listinfo/users