Hi all, I'm trying to add EC2 instances into my local cluster with openMPI. So far openMPI works well on the local cluster, and I have set up passwordless SSH between the local cluster and the Amazon EC2 instance.
Howver, when I add the public DNS into a file (defaulthostfiletest) comp1 slots=2 max-slots=8 comp2 slots=2 max-slots=8 comp3 slots=2 max-slots=4 ec2-174-129-183-64.compute-1.amazonaws.com slots=2 max-slots=2 and then run: [/home/ntlp/cashmoney/mainFrame]$mpirun -np 6 --hostfile defaulthostfiletest hostname foretell foretell augur augur predict predict it works, but trying to use the amazon cluster I get: [/home/ntlp/cashmoney/mainFrame]$mpirun -np 8 --hostfile defaulthostfiletest hostname (it hangs so I kill it) ^C^Cmpirun: killing job... -------------------------------------------------------------------------- mpirun noticed that the job aborted, but has no info as to the process that caused that situation. -------------------------------------------------------------------------- -------------------------------------------------------------------------- mpirun was unable to cleanly terminate the daemons on the nodes shown below. Additional manual cleanup may be required - please refer to the "orte-clean" tool for assistance. -------------------------------------------------------------------------- ec2-174-129-183-64.compute-1.amazonaws.com - daemon did not report back when launched Any advice? are there any settings in /etc/sssh/sshd_config that I might need to change? Theo -- Theodore Van Rooy http://greentheo.scroggles.com -- Theodore Van Rooy http://greentheo.scroggles.com