Re: [OMPI users] users Digest, Vol 1212, Issue 3, Message: 2

2009-04-27 Thread jan
Thank You Jeff Squyres. I have checked out the web page http://www.open-mpi.org/community/lists/announce/2009/03/0029.php, then the page https://svn.open-mpi.org/trac/ompi/ticket/1853 , but the web page svn.open-mpi.org seems crash. Then I tried OpenMpi V1.3.2 for many different configuratio

Re: [OMPI users] Upgrade from Open MPI 1.2 to 1.3

2009-04-27 Thread Ralph Castain
Remember also that the RTE API's changed between 1.2 and 1.3 - so I'm not sure what will happen in that case. It could be that the ones touching the MPI layer remained stable (don't honestly recall), though I believe there are RTE calls in 1.3 that don't exist in 1.2. I would think you would have a

Re: [OMPI users] Checkpointing hangs with OpenMPI-1.3.1

2009-04-27 Thread Josh Hursey
I still have not been able to reproduce the hang, but I'm still looking into it. I did commit a fix for the datatype copy error that I mentioned (r21080 in the Open MPI trunk, and it is in the pipeline for v1.3). Can you put in a print statement before MPI_Finalize, then try the program a

Re: [OMPI users] Upgrade from Open MPI 1.2 to 1.3

2009-04-27 Thread Jeff Squyres
I'd actually be surprised if it works. The back-end sizes of Open MPI structures definitely changed between 1.2 and 1.3. We used to think that this didn't matter, but then we found out that we were wrong. :-) Hence, I'd think that the same exact issues you have with taking a 1.2-compiled

Re: [OMPI users] Upgrade from Open MPI 1.2 to 1.3

2009-04-27 Thread Brian W. Barrett
I think Serge is talking about compiling the application against one version of Open MPI, linking dynamically, then running against another version of Open MPI. Since it's dynamically linked, the ORTE/OMPI interactions are covered (the version of mpirun, libopen-rte, and libmpi all match). Th

Re: [OMPI users] Upgrade from Open MPI 1.2 to 1.3

2009-04-27 Thread Ralph Castain
It's hard for me to believe that would work as there are fundamental differences in the MPI-to-RTE interactions between those releases. If it does, it could be a fluke - I personally would not trust it. Ralph On Mon, Apr 27, 2009 at 12:04 PM, Serge wrote: > Hi Jeff, > > > That being said, we ha

Re: [OMPI users] Upgrade from Open MPI 1.2 to 1.3

2009-04-27 Thread Serge
Hi Jeff, > That being said, we have fixed this issue and expect to support binary > compatibility between Open MPI releases starting with v1.3.2 (v1.3.1 As far as I can tell from reading the release notes for v1.3.2, the binary compatibility has not been announced yet. It was rather a bug fix

Re: [OMPI users] Question about restart

2009-04-27 Thread Josh Hursey
Thanks for the patch. I applied a version of it to the Open MPI trunk (r21079) and started the process of moving it to the v1.3 release series: https://svn.open-mpi.org/trac/ompi/ticket/1898 Thanks, Josh On Apr 27, 2009, at 9:53 AM, Josh Hursey wrote: Thanks for the bug report. I am hav

Re: [OMPI users] Question about restart

2009-04-27 Thread Josh Hursey
Thanks for the bug report. I am having a difficult time reproducing the error. Are you running on a single machine using shared memory or across multiple machine using a high speed network? Based on your bug report, my suspicion is that an event is not being properly de-registered from th

Re: [OMPI users] Checkpointing hangs with OpenMPI-1.3.1

2009-04-27 Thread Josh Hursey
Sorry for the long delay to respond. It is a bit odd that the hang does not occur when running on only one host. I suspect that is more due to timing than anything else. I am not able to reproduce the hang at the moment, but I do get an occasional datatype copy error which could be symptoma