Hi, Am 05.09.2012 um 06:42 schrieb San B:
> OpenMPI-1.6.1 is installed on Rocks-5.5 Linux cluster with intel > compilers and OFED-1.5.3. A sample Helloworld MPI program gives following > warning message: > > > /mpi/openmpi/1.6.1/intel/bin/mpirun -np 4 ./mpi > -------------------------------------------------------------------------- > WARNING: It appears that your OpenFabrics subsystem is configured to only > allow registering part of your physical memory. This can cause MPI jobs to > run with erratic performance, hang, and/or crash. > > This may be caused by your OpenFabrics vendor limiting the amount of > physical memory that can be registered. You should investigate the > relevant Linux kernel module parameters that control how much physical > memory can be registered, and increase them to allow registering all > physical memory on your machine. > > See this Open MPI FAQ item for more information on these Linux kernel module > parameters: > > http://www.open-mpi.org/faq/?category=openfabrics#ib-locked-pages > > Local host: masternode > Registerable memory: 4096 MiB > Total memory: 32151 MiB > -------------------------------------------------------------------------- > Greetings: 1 of 4 from the node masternode > Greetings: 2 of 4 from the node masternode > Greetings: 3 of 4 from the node masternode > Greetings: 0 of 4 from the node masternode > [masternode:29820] 3 more processes have sent help message > help-mpi-btl-openib.txt / reg mem limit low > [masternode:29820] Set MCA parameter "orte_base_help_aggregate" to 0 to see > all help / error messages > > The ulimit parameters also set to unlimited: > > ]# ulimit -a > core file size (blocks, -c) 0 > data seg size (kbytes, -d) unlimited > scheduling priority (-e) 0 > file size (blocks, -f) unlimited > pending signals (-i) 278528 > max locked memory (kbytes, -l) unlimited > max memory size (kbytes, -m) unlimited > open files (-n) 1024 > pipe size (512 bytes, -p) 8 > POSIX message queues (bytes, -q) 819200 > real-time priority (-r) 0 > stack size (kbytes, -s) unlimited > cpu time (seconds, -t) unlimited > max user processes (-u) 278528 > virtual memory (kbytes, -v) unlimited > file locks (-x) unlimited > > > The file /etc/securoty/limits.conf contains following lines: > > * soft memlock unlimited > * hard memlock unlimited > > But why still OpenMPI is throwing warning message wrt registered memory. These are not honored when a job is started by SGE, instead definitions inside SGE are used: `man sge_config` paragraph H_MEMORYLOCKED. execd_params H_MEMORYLOCKED=unlimited -- Reuti http://arc.liv.ac.uk/pipermail/gridengine-users/2008-July/019722.html > Thanks in advance > > _______________________________________________ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users