> Hi: > I managed to run a 256 process job on a single node. I ran a simple test > in which all processes send a message to all others. > This was using Sun's Binary Distribution of Open MPI on Solaris which is > based on r16572 of the 1.2 branch. The machine had 8 cores. > > burl-ct-v40z-0 49 =>/opt/SUNWhpc/HPC7.1/bin/mpirun --mca > mpool_sm_max_size 2147483647 -np 256 connectivity_c > Connectivity test on 256 processes PASSED. > burl-ct-v40z-0 50 => > burl-ct-v40z-0 50 =>/opt/SUNWhpc/HPC7.1/bin/mpirun --mca > mpool_sm_max_size 2147483647 -np 300 connectivity_c -v > Connectivity test on 300 processes PASSED. > > burl-ct-v40z-0 54 =>limit > cputime unlimited > filesize unlimited > datasize unlimited > stacksize 10240 kbytes > coredumpsize 0 kbytes > vmemoryuse unlimited > descriptors 65536 > burl-ct-v40z-0 55 =>
Thank you for Solaris results. I compared your environment with the user limit of our cluster. The limit of open files seemed too small even for executing 256 processes.If we increased the limit, I was able to execute 256 processes per node. SUSUKITA, Ryutaro Peta-scale System Interconnect Project Fukuoka Industry, Science & Technology Foundation