About 9 months ago we had a new installation with a system of 1800 cores and at
the time we found that jobs with more than 1028 cores would not start. At the
time a colleague found that setting
OMPI_MCA_plm_rsh_num_concurrent=256
help with the problem.
We have now increased our processor count to more than 2700 cores and a job with
2,500 jobs does not start.
Is there any advice?
Best wishes,
Lydia Heck
------------------------------------------
Dr E L Heck
Senior Computer Manager
University of Durham
Institute for Computational Cosmology
Ogden Centre
Department of Physics
South Road
DURHAM, DH1 3LE
United Kingdom
e-mail: lydia.h...@durham.ac.uk
Tel.: + 44 191 - 334 3628
Fax.: + 44 191 - 334 3645
___________________________________________