[OMPI users] CentOS 6.3 & OpenMPI 1.6.3

2012-11-27 Thread Joseph Farran
Howdy. I a have a stock CentOS 6.3 OS and a Mellanox MT26428 card. I installed the Mellanox OFED MLNX_OFED_LINUX-1.5.3-3.1.0-rhel6.3-x86_64 which installed just fine. Rebooted the system and when I try building OpenMPI 1.6.3, it aborts with: mtl_mxm_send.c: In function 'ompi_mtl_mxm_isend':

Re: [OMPI users] Maximum number of MPI processes on a node + discovering faulty nodes

2012-11-27 Thread George Markomanolis
Dear Ralph, For the file descriptors the declared limit is over 65536 files but if OMPI needs several of them, then this can be interesting. Is there any source to read about it or I just should do trials? About the child processes again, can I do something? I have root access, so I can change

Re: [OMPI users] Maximum number of MPI processes on a node + discovering faulty nodes

2012-11-27 Thread Ralph Castain
Just glancing at the code, I don't see anything tied to 2**12 that pops out at me. I suspect the issue is that you are hitting a system limit on the number of child processes a process can spawn - this is different from the total number of processes allowed on the node - or the number of file de

Re: [OMPI users] Maximum number of MPI processes on a node + discovering faulty nodes

2012-11-27 Thread George Markomanolis
Dear Jeff, Of course I was thinking to execute memtester on each node on the same time and gather the outputs. However executing memtester on a node with 48GB memory it takes a lot of time (more than 1-2 hours, I don't remember exactly, maybe even more because I cancelled its execution) and I

Re: [OMPI users] Maximum number of MPI processes on a node + discovering faulty nodes

2012-11-27 Thread George Markomanolis
Dear Ralph, Thanks for the answer, I am using OMPI v1.4.1. Best regards, George Markomanolis On 11/26/2012 05:07 PM, Ralph Castain wrote: What version of OMPI are you using? On Nov 26, 2012, at 1:02 AM, George Markomanolis wrote: Dear all, Initially I would like an advice of how to ident

[OMPI users] How to reduce the use of the network by Open MPI job?

2012-11-27 Thread just me
Hi all, I have a question related to Open MPI btl_tcp module. How to slow down OMPI in terms of network utilization ? My OpenMPI based application (lets call it APP1) is sharing hardware components with other software (lets call it APP2) which is very sensitive on network saturation. The APP2 suffe