he section.
http://www.open-mpi.org/faq/?category=running#mpirun-scheduling
Happy computing,
Mark Kosmowski
> Message: 1
> Date: Wed, 16 Apr 2008 14:32:58 +0200
> From: " Jozef K??er "
> Subject: Re: [OMPI users] open mpi on smp
>
: 8
> Date: Wed, 09 Apr 2008 22:17:59 +0200
> From: Danesh Daroui
> Subject: Re: [OMPI users] submitted job stops
> To: Open MPI Users
> Message-ID: <47fd2477.1010...@bredband.net>
> Content-Type: text/plain; charset=ISO-8859-1; format=flowed
>
> Mark Kosmowski skrev:
&
Danesh:
Have you tried "mpirun -np 4 --hostfile hosts hostname" to verify that
ompi is working?
Can you remote access from each node to each other node?
If any node has more than 1 network device, are you using the ompi
options to specify which device to use?
Good luck,
Mark
> Message: 5
> Da
I have a successful ompi installation and my software runs across my
humble cluster of three dual-Opteron (single core) nodes on OpenSUSE
10.2. I'm planning to upgrade some RAM soon and have been thinking of
playing with affinity, since each cpu will have it's own DIMMs after
the upgrade. I have
Giovani:
Which compiler are you using?
Also, you didn't mention this, but does "mpirun hostname" give the
expected response? I (also new) had a hang similar to what you are
describing due to ompi getting confused as to which of two network
interfaces to use - "mpirun hostname" would hang when st
Are you pointing to the 64-bit build of HYPRE? For that matter, like Jeff
asked, are you sure that each library path that you are defining goes to a
64-bit library path?
Good luck,
Mark E. Kosmowski
peiying@saturn:~/elmer/elmer-5.4.0/fem-5.4.0> export
> LD_LIBRARY_PATH=/usr/local/openmpi/lib:/
On Jan 15, 2008 7:54 PM, Mark Kosmowski wrote:
> Dear Open-MPI Community:
>
> I have a 3 node cluster, each a dual opteron workstation running
> OpenSUSE 10.1 64-bit. The node names are LT, SGT and PFC. When I
> start an mpirun job from either SGT or PFC, things work as they ar
Dear Open-MPI Community:
I have a 3 node cluster, each a dual opteron workstation running
OpenSUSE 10.1 64-bit. The node names are LT, SGT and PFC. When I
start an mpirun job from either SGT or PFC, things work as they are
supposed to. However, if I start the same job from LT, the jobs hangs
at
guration so
that OMPI only uses the gigabit ports (and not the internet
connections that some of the machines have).
I am already planning on doing some benchmark comparisons to determine
the effect of compiler / math library on speed.
Thank you,
Mark Kosmowski
FWIW, what did MPICH say for the error?
I followed the install.pdf that comes with mpich. They have you start
up the daemon ring then run mpdtrace. This command tells each daemon
instance to report the hostname. I don't remember the exact error
message, but it was very clear that NODENAME was
sanity checking, there was a more verbose error message
regarding the failed node, so I reinstalled the OS, reconfigured my
environment variables for OpenMPI and everything is now working.
Thanks for the help and support so far,
Mark Kosmowski
On 2/7/07, Mark Kosmowski wrote:
Dear Open-MPI list
works.
Is there a better way of accomplishing this, or is this a matter of
there being more than one way to skin the proverbial cat?
Thanks,
Mark Kosmowski
On 2/8/07, Mark Kosmowski wrote:
I think I fixed the problem. I at least have mpirun ... hostname
working over the cluster.
The fir
mpi interface and that the two
-mca switches should be used? This could perhaps be most useful to a
beginner in either the 'Running MPI Jobs' or 'Troubleshooting'
sections of the FAQ.
Thanks,
Mark Kosmowski
Please find attached a tarball containing the stderror of mpirun ...
hostname across my cluster as well as the output from ompi_info.
Apologies for not including these earlier.
Thank you for any and all assistance,
Mark Kosmowski
ompi-output.tar.gz
Description: GNU Zip compressed data
er
node will always be having an internet connection in addition to the
gigabit cluster network).
I hope this is helpful to try to help me troubleshoot my system.
Thanks!
Mark Kosmowski
at should I be trying to do next to remedy this issue?
Any help would be appreciated.
Thanks,
Mark Kosmowski
16 matches
Mail list logo