[OMPI users] Open MPI exited on signal 11 (Segmentation fault). Trying to run a script that uses Open MPI

2013-07-04 Thread Rick White
Hello, I have this error: mpiexec noticed that process rank 1 with PID 16087 on node server exited on signal 11 (Segmentation fault) Wondering how to fix it? Cheers and many thanks Rick -- Richard Allen White III M.S. PhD Candidate - Suttle Lab Department of Microbiology & Immunology The Unive

[OMPI users] checkpoint-restart of version 1.6.5

2013-07-04 Thread basma a.azeem
does open mpi 1.6.5 support checkpoint restart ( self or blcr) ? i did not find ompi-checkpoint or ompi-restart in the documentation list of version 1.6 in the site. or it uses another exe names ? thank you

Re: [OMPI users] example program "ring" hangs when running across multiple hardware nodes

2013-07-04 Thread Ralph Castain
You also might want to check that you don't have any firewalls between those nodes. This is a typical cause of what you describe. On Jul 4, 2013, at 4:25 PM, Gustavo Correa wrote: > Hi Jed > > You could try to select only ethernet interface that match your node's IP > addresses, > which see

Re: [OMPI users] example program "ring" hangs when running across multiple hardware nodes

2013-07-04 Thread Gustavo Correa
Hi Jed You could try to select only ethernet interface that match your node's IP addresses, which seems to be en2. The en1 interface seems to be an external IP. Not sure about en3, but it is awkward that it has a different IP than en2, but in the same subnet. I wonder if this may be the reaso

[OMPI users] example program "ring" hangs when running across multiple hardware nodes

2013-07-04 Thread Jed O. Kaplan
Dear openmpi gurus, I am running openmpi 1.7.2 on a homogenous cluster of Apple XServes running OS X 10.6.8. My hardware nodes are connected through four gigabit ethernet connections; I have no infiniband or other high-speed interconnect. The problem I describe below is the same if I use openmpi 1

[OMPI users] checkpoint - restart of version 1.6.5

2013-07-04 Thread basma a.azeem
does open mpi 1.6.5 support checkpoint restart ( self or blcr) ? i did not find ompi-checkpoint or ompi-restart in the documentation list of version 1.6 in the site. or it uses another exe names ? thank you