Re: [OMPI users] Bogus memcpy or bogus valgrind record

2009-04-29 Thread Jeff Squyres
On Apr 29, 2009, at 3:40 PM, Jed Brown wrote: > But I'm able to replicate your error (but shouldn't the 2nd buffer be > the 1st + size (not 2)?) -- let me dig into it a bit... we definitely > shouldn't be getting invalid writes in the convertor, etc. As Eugene pointed out earlier, it is fin

Re: [OMPI users] Bogus memcpy or bogus valgrind record

2009-04-29 Thread Jed Brown
Jeff Squyres wrote: > But I'm able to replicate your error (but shouldn't the 2nd buffer be > the 1st + size (not 2)?) -- let me dig into it a bit... we definitely > shouldn't be getting invalid writes in the convertor, etc. As Eugene pointed out earlier, it is fine. dataloctab = malloc (2 * (

Re: [OMPI users] Purify found bugs inside open-mpi library

2009-04-29 Thread Jeff Squyres
Per your mail on the devel list, we'll follow up there. Many thanks! On Apr 29, 2009, at 1:09 PM, Brian Blank wrote: To Whom This May Concern: I've been trying to dig a little deeper into this problem and found some additional information. First, the stack trace for the ABR and ABW were d

Re: [OMPI users] Bogus memcpy or bogus valgrind record

2009-04-29 Thread Jeff Squyres
On Apr 22, 2009, at 7:35 PM, François PELLEGRINI wrote: I have had no answers regarding the trouble (OpenMPI bug ?) I evidenced when combining OpenMPI and valgrind. Sorry for the delay in getting back to you; there are so many mails and only so many hours in the day... :-( I tried it wi

Re: [OMPI users] Purify found bugs inside open-mpi library

2009-04-29 Thread Brian Blank
To Whom This May Concern: I've been trying to dig a little deeper into this problem and found some additional information. First, the stack trace for the ABR and ABW were different. The ABR problem occurred in datatype_pack.h while the ABW problem occurred in datatype_unpack.h. The problem appea

Re: [OMPI users] Problem with running openMPI program

2009-04-29 Thread Gus Correa
Hi Ankush You can run the MITgcm ocean model test cases and the CAM3 atmospheric model test with two processors only, but the codes scale well to any number of processors. They are "real life" applications, but not too hard to get to work. It will take some reading of their README and INSTALL fil

Re: [OMPI users] running problem on Dell blade server, confirm 2d21ce3ce8be64d8104b3ad71b8c59e2514a72eb

2009-04-29 Thread Jeff Squyres
On Apr 25, 2009, at 11:59 AM, Anton Starikov wrote: I can confirm that I have exactly the same problem, also on Dell system, even with latest openpmpi. Our system is: Dell M905 OpenSUSE 11.1 kernel: 2.6.27.21-0.1-default ofed-1.4-21.12 from SUSE repositories. OpenMPI-1.3.2 But what I can als

Re: [OMPI users] Problem with running openMPI program

2009-04-29 Thread Ankush Kaul
@Gus the applications in the links u have sent are really high level n i believe really expensive too as i will have 2 have a physical apparatus for various measurements along with the cluster. Am i right?

Re: [OMPI users] Problem with running openMPI program

2009-04-29 Thread Ankush Kaul
Are there any application that i can implement on a small level, in a lab or something??? Also what do for clustering web servers? On Wed, Apr 29, 2009 at 2:46 AM, Gus Correa wrote: > Hi Ankush > > Glad to hear that your MPI and cluster project were successful. > > I don't know if you would ca

Re: [OMPI users] ORTE_ERROR_LOG: Timeout in file

2009-04-29 Thread Hugh Dickinson
The remote node starts the following process when mpirun is executed on the local node: 25734 ?Ss 0:00 /usr/lib/openmpi/1.2.5-gcc/bin/orted -- bootproxy 1 -- I checked and it was not running before mpirun was executed. I'll look into installing a more recent version of Open MPI.