If the only thing you really need is what you just described, then FT-
MPI is your best pick. At least until we finish moving the fault
tolerance features from FT-MPI in Open MPI. Giving you a time frame
it will be difficult, the only thing I can state it's that this will
not happens before
The kind of recovery I am seeking after is easy, and the following
simple example illustrates the point:
I want to send a message to a different node. If it does not respond to
me, I do not want my application to crash. I want to continue using
other node resources.
I hate it when a node cra
I have used POSIX threading and Open MPI without problems on our Opteron
2216 Cluster (4 cores per node). Moving to core-level parallelization
with multi threading resulted in significant performance gains.
Sam Adams wrote:
I have been looking, but I haven't really found a good answer about
sy
P.s. I should have said this this is a pretty course-grained application,
and netstat doesn't show much communication going on (except in stages).
On 3/21/07 4:21 PM, "Heywood, Todd" wrote:
> I noticed that my OpenMPI processes are using larger amounts of system time
> than user time (via vmsta
i am experiencing some issues w/ openmpi 1.2 running on a rocks
4.2.1cluster(the issues also appear to occur w/ openmpi
1.1.5 and 1.1.4).
when i run my program with the frontend in the list of nodes, they deadlock.
when i run my program without the frontend in the list of nodes, they run to
comp
I noticed that my OpenMPI processes are using larger amounts of system time
than user time (via vmstat, top). I'm running on dual-core, dual-CPU
Opterons, with 4 slots per node, where the program has the nodes to
themselves. A closer look showed that they are constantly switching between
run and sl
What you're looking for is called PVM. Moreover, your requirements
are a mixed bags of FT features that comes from completely different
worlds.
1) Recover any software/hardware crashes ? What kind of recovery
you're looking for ? What is your definition of recovering ? If what
you want is
To migrate processes, you need to be able to checkpoint them. I
believe that LAM-MPI is the only MPI implementation that allows this,
although I have never used LAM-MPI.
Good luck.
Tom Spraggins
t...@virginia.edu
On Mar 21, 2007, at 1:09 PM, Mohammad Huwaidi wrote:
Hello folks,
I am try
Hello folks,
I am trying to write some fault-tolerance systems with the following
criteria:
1) Recover any software/hardware crashes
2) Dynamically Shrink and grow.
3) Migrate processes among machines.
Does anyone has examples of code? What MPI platform is recommended to
accomplish such requi
I tried to use multi-threading using OpenMPI but it did not workout for
me; however, it worked perfectly with MPICH2.
users-requ...@open-mpi.org wrote:
Send users mailing list submissions to
us...@open-mpi.org
To subscribe or unsubscribe via the World Wide Web, visit
http://www
I have been looking, but I haven't really found a good answer about
system level threading. We are about to get a new cluster of
dual-processor quad-core nodes or 8 cores per node. Traditionally I
would just tell MPI to launch two processes per dual processor single
core node, but with eight cor
hello,
I'm trying to use netpipe with openmpi on my system (rhel 3, dual opteron,
myrinet 2G with MX drivers).
Everything is fine when i use a 64bit binary, but it segfaults when i use a
32 bit binary :
nniclausse# mpirun -machinefile ./machines ./NPmpi
[helios38:15657] *** Process received s
Deadline March 31st is approaching.
Apologies if you received multiple copies of this posting.
Please feel free to distribute it to those who might be interested.
-
13 matches
Mail list logo