On 14 December 2010 17:32, Lydia Heck <lydia.h...@durham.ac.uk> wrote: > > I have experimented a bit more and found that if I set > > OMPI_MCA_plm_rsh_num_concurrent=1024 > > a job with more than 2,500 processes will start and run. > > However when I searched the open-mpi web site for the the variable I could > not find any indication.
Lydia, a quick search find this page: http://docs.sun.com/source/820-3176-10/appb-mca.html It may be out of data, but does describe the parameters. What is your setting for plm_rsh_agent (ie are you using ssh or rsh) and also have you tried setting plm_rsh_debug