Re: [OMPI users] users Digest, Vol 536, Issue 2

2007-03-21 Thread George Bosilca
If the only thing you really need is what you just described, then FT- MPI is your best pick. At least until we finish moving the fault tolerance features from FT-MPI in Open MPI. Giving you a time frame it will be difficult, the only thing I can state it's that this will not happens before

Re: [OMPI users] users Digest, Vol 536, Issue 2

2007-03-21 Thread Mohammad Huwaidi
The kind of recovery I am seeking after is easy, and the following simple example illustrates the point: I want to send a message to a different node. If it does not respond to me, I do not want my application to crash. I want to continue using other node resources. I hate it when a node cra

Re: [OMPI users] threading

2007-03-21 Thread David Burns
I have used POSIX threading and Open MPI without problems on our Opteron 2216 Cluster (4 cores per node). Moving to core-level parallelization with multi threading resulted in significant performance gains. Sam Adams wrote: I have been looking, but I haven't really found a good answer about sy

Re: [OMPI users] MPI processes swapping out

2007-03-21 Thread Heywood, Todd
P.s. I should have said this this is a pretty course-grained application, and netstat doesn't show much communication going on (except in stages). On 3/21/07 4:21 PM, "Heywood, Todd" wrote: > I noticed that my OpenMPI processes are using larger amounts of system time > than user time (via vmsta

[OMPI users] deadlock on barrier

2007-03-21 Thread tim gunter
i am experiencing some issues w/ openmpi 1.2 running on a rocks 4.2.1cluster(the issues also appear to occur w/ openmpi 1.1.5 and 1.1.4). when i run my program with the frontend in the list of nodes, they deadlock. when i run my program without the frontend in the list of nodes, they run to comp

[OMPI users] MPI processes swapping out

2007-03-21 Thread Heywood, Todd
I noticed that my OpenMPI processes are using larger amounts of system time than user time (via vmstat, top). I'm running on dual-core, dual-CPU Opterons, with 4 slots per node, where the program has the nodes to themselves. A closer look showed that they are constantly switching between run and sl

Re: [OMPI users] Fault Tolerance

2007-03-21 Thread George Bosilca
What you're looking for is called PVM. Moreover, your requirements are a mixed bags of FT features that comes from completely different worlds. 1) Recover any software/hardware crashes ? What kind of recovery you're looking for ? What is your definition of recovering ? If what you want is

Re: [OMPI users] Fault Tolerance

2007-03-21 Thread Thomas Spraggins
To migrate processes, you need to be able to checkpoint them. I believe that LAM-MPI is the only MPI implementation that allows this, although I have never used LAM-MPI. Good luck. Tom Spraggins t...@virginia.edu On Mar 21, 2007, at 1:09 PM, Mohammad Huwaidi wrote: Hello folks, I am try

[OMPI users] Fault Tolerance

2007-03-21 Thread Mohammad Huwaidi
Hello folks, I am trying to write some fault-tolerance systems with the following criteria: 1) Recover any software/hardware crashes 2) Dynamically Shrink and grow. 3) Migrate processes among machines. Does anyone has examples of code? What MPI platform is recommended to accomplish such requi

Re: [OMPI users] users Digest, Vol 536, Issue 1

2007-03-21 Thread Mohammad Huwaidi
I tried to use multi-threading using OpenMPI but it did not workout for me; however, it worked perfectly with MPICH2. users-requ...@open-mpi.org wrote: Send users mailing list submissions to us...@open-mpi.org To subscribe or unsubscribe via the World Wide Web, visit http://www

[OMPI users] threading

2007-03-21 Thread Sam Adams
I have been looking, but I haven't really found a good answer about system level threading. We are about to get a new cluster of dual-processor quad-core nodes or 8 cores per node. Traditionally I would just tell MPI to launch two processes per dual processor single core node, but with eight cor

[OMPI users] segfault with netpipe & ompi 1.2 + MX (32bit only)

2007-03-21 Thread Nicolas Niclausse
hello, I'm trying to use netpipe with openmpi on my system (rhel 3, dual opteron, myrinet 2G with MX drivers). Everything is fine when i use a 64bit binary, but it segfaults when i use a 32 bit binary : nniclausse# mpirun -machinefile ./machines ./NPmpi [helios38:15657] *** Process received s

[OMPI users] HotI 2007 Call for Papers -- 4th call. Deadline March 31st is approaching

2007-03-21 Thread Weikuan Yu
Deadline March 31st is approaching. Apologies if you received multiple copies of this posting. Please feel free to distribute it to those who might be interested. -