hi guys im having a strange problem when starting some jobs that i dont uderstand.
its just 1 node that has an issue and i find it odd. The OpenFabrics (openib) BTL failed to initialize while trying to allocate some locked memory. This typically can indicate that the memlock limits are set too low. For most HPC installations, the memlock limits should be set to "unlimited". The failure occured here: OMPI source: btl_openib_component.c:1066 Function: ompi_free_list_init_ex_new() Device: mlx4_0 Memlock limit: 65536 i have rechecked and verified everywhere that the limit is set to "unlimited". i can run the same job on other nodes and they have no problems, just so strnage that this one node crashes. running open_mpi 1.4.5 not sure where to look next. this node was working 2 days ago, but after a reboot it has changed to this. Denne e-posten kan innehalde informasjon som er konfidensiell og/eller underlagt lovbestemt teieplikt. Kun den tiltenkte adressat har adgang til å lese eller vidareformidle denne e-posten eller tilhøyrande vedlegg. Dersom De ikkje er den tiltenkte mottakar, vennligst kontakt avsendar pr e-post, slett denne e-posten med vedlegg og makuler samtlige utskrifter og kopiar av den. This e-mail may contain confidential information, or otherwise be protected against unauthorised use. Any disclosure, distribution or other use of the information by anyone but the intended recipient is strictly prohibited. If you have received this e-mail in error, please advise the sender by immediate reply and destroy the received documents and any copies hereof. PBefore printing, think about the environment