On Jun 19, 2012, at 19:23 , Ralph Castain wrote:
> What we have said is that we don't believe the FT "run thru failure" position
> pushed by UTK is particularly required at this time. Partly a question of
> impact vs benefit, mostly due to competing approaches offering equivalent
> fault recov
Just finished doing that.
Still getting the same error. How do I make sure there are no old
builds/files left?
I uninstalled everything to do with MPI, Cygwin, cleared environment
variables, did the whole Windows build again and then did the
supercomputing tutorial.
--
Vimal
From: users-
That's a little bit strong - OMPI still supports checkpoint/restart as a fault
tolerance mechanism. There really isn't anything the sys admin has to do,
though - what is required is that users periodically order their programs to
checkpoint so they can be restarted after a failure.
Checkpointin
There's something else wrong, if that's the Supercomputing Blog tutorial
1 you're running. It works happily without a hostfile. I think you
have some borked paths there.
I don't know why a Windows version is looking for an etc directory for a
hostfile, unless there's some of your previous Cy
It has been clearly stated that the official position pushed forward by a
majority of the Open MPI developer community is that fault tolerance is not
needed so we (read this as the official version of Open MPI) do not support it.
However, a group of researchers have been working toward a version
Is hostname the name of the system I'm running it on?
Just tried that. Got the same error message
--
Vimal
From: users-boun...@open-mpi.org [mailto:users-boun...@open-mpi.org] On
Behalf Of Ralph Castain
Sent: Tuesday, June 19, 2012 1:03 PM
To: Open MPI Users
Subject: Re: [OMPI users] U
You're getting that error because you failed to specify any hosts on your cmd
line - so OMPI has no idea where to launch the procs. It looked for a default
hostfile, but didn't find that either.
Just add a -host option to your command line and tell it where you
want the procs to run.
On Jun
Damien, Shiqing, Jeff?
--
Vimal
From: users-boun...@open-mpi.org [mailto:users-boun...@open-mpi.org] On Behalf
Of vimalmat...@eaton.com
Sent: Monday, June 18, 2012 3:32 PM
To: us...@open-mpi.org
Subject: [OMPI users] Using OpenMPI on a network
So I configured and compiled a simple MPI
Dear Rolf,
I compiled openmpi-trunk with $ ../configure --prefix=/opt/openmpi-trunk
--disable-mpi-interface-warning --with-cuda=/opt/cuda
And that error is now gone!
Thanks a lot for your assistance,
- D.
2012/6/19 Rolf vandeVaart
> Dmitry:
>
> ** **
>
> It turns out that by default in Ope
Hi all,Can anyone explain me the fault tolerant features in OpenMPI? I've read
the FAQs and some papers about this topic listed in open-mpi.org, but still
can't figure out when one node of my supercomputer system fails down during
computing, what would happen with the fault tolerant mechanism in
Dear,
Please help.
I configured the open mpi and it can checkpoint HPL.
However, whenever I want to checkpoint NAS parallel benchmark it kills the
application without informative message.
Please how do I configure the openmpi 1.6 to checkpoint NPB? I really need
a help, I have been on this issu
11 matches
Mail list logo