Hello, We have a random hungs of some applications (NAMD, Molpro, ...) when using openib BTL.
We are using ompi 1.4.3 and ompi 1.3.4 compiled with icc intel compiler. linux kernel : 2.6.18-128 RH, node have 8 cores. OFED version : 3.2 ibv_devifno seems to be ok on all nodes. Note that we dont have problems when running with TCP. when i do strace -p value I got this infinite output : poll([{fd=4, events=POLLIN}, {fd=5, events=POLLIN}, {fd=6, events=POLLIN}, {fd=7, events=POLLIN} .. .. Any idea? Than you for your help. nixter