Re: [OMPI users] strange bug

2009-05-12 Thread Anton Starikov
I will try to prepare test-case. -- Anton Starikov. On May 12, 2009, at 6:57 PM, Edgar Gabriel wrote: hm, so I am out of ideas. I created multiple variants of test- programs which did what you basically described, and they all passed and did not generate problems. I compiled the MUMPS libr

Re: [OMPI users] strange bug

2009-05-12 Thread Edgar Gabriel
hm, so I am out of ideas. I created multiple variants of test-programs which did what you basically described, and they all passed and did not generate problems. I compiled the MUMPS library and ran the tests that they have in the examples directory, and they all worked. Additionally, I checke

Re: [OMPI users] strange bug

2009-05-12 Thread Edgar Gabriel
I would say the probability is large that it is due to the recent 'fix'. I will try to create a testcase similar to what you suggested. Could you give us maybe some hints on which functionality of MUMPS you are using, or even share the code/ a code fragment? Thanks Edgar Jeff Squyres wrote:

Re: [OMPI users] strange bug

2009-05-12 Thread Jeff Squyres
Hey Edgar -- Could this have anything to do with your recent fixes? On May 12, 2009, at 8:30 AM, Anton Starikov wrote: hostfile from torque PBS_NODEFILE (OMPI is compilled with torque support) It happens with or without rankfile. Started with mpirun -np 16 ./somecode mca parameters: btl = s

Re: [OMPI users] strange bug

2009-05-12 Thread Anton Starikov
hostfile from torque PBS_NODEFILE (OMPI is compilled with torque support) It happens with or without rankfile. Started with mpirun -np 16 ./somecode mca parameters: btl = self,sm,openib mpi_maffinity_alone = 1 rmaps_base_no_oversubscribe = 1 (rmaps_base_no_oversubscribe = 0 doesn't change i

Re: [OMPI users] strange bug

2009-05-12 Thread Jeff Squyres
Can you send all the information listed here: http://www.open-mpi.org/community/help/ On May 11, 2009, at 10:03 PM, Anton Starikov wrote: By the way, this if fortran code, which uses F77 bindings. -- Anton Starikov. On May 12, 2009, at 3:06 AM, Anton Starikov wrote: > Due to rankfile

Re: [OMPI users] strange bug

2009-05-11 Thread Anton Starikov
By the way, this if fortran code, which uses F77 bindings. -- Anton Starikov. On May 12, 2009, at 3:06 AM, Anton Starikov wrote: Due to rankfile fixes I switched to SVN r21208, now my code dies with error [node037:20519] *** An error occurred in MPI_Comm_dup [node037:20519] *** on communic

[OMPI users] strange bug

2009-05-11 Thread Anton Starikov
Due to rankfile fixes I switched to SVN r21208, now my code dies with error [node037:20519] *** An error occurred in MPI_Comm_dup [node037:20519] *** on communicator MPI COMMUNICATOR 32 SPLIT FROM 4 [node037:20519] *** MPI_ERR_INTERN: internal error [node037:20519] *** MPI_ERRORS_ARE_FATAL (you