hi guys

im having a strange problem when starting some jobs that i dont uderstand.

its just 1 node that has an issue and i find it odd.

The OpenFabrics (openib) BTL failed to initialize while trying to
allocate some locked memory.  This typically can indicate that the
memlock limits are set too low.  For most HPC installations, the
memlock limits should be set to "unlimited".  The failure occured
here:

  OMPI source:   btl_openib_component.c:1066
  Function:      ompi_free_list_init_ex_new()
  Device:        mlx4_0
  Memlock limit: 65536


i have rechecked and verified everywhere that the limit is set to 
"unlimited".

i can run the same job on other nodes and they have no problems, just so 
strnage that this one node crashes.

running open_mpi 1.4.5 


not sure where to look next.

this node was working 2 days ago, but after a reboot it has changed to 
this.









Denne e-posten kan innehalde informasjon som er konfidensiell 
og/eller underlagt lovbestemt teieplikt. Kun den tiltenkte adressat har 
adgang 
til å lese eller vidareformidle denne e-posten eller tilhøyrande vedlegg. 
Dersom De ikkje er den tiltenkte mottakar, vennligst kontakt avsendar pr 
e-post, slett denne e-posten med vedlegg og makuler samtlige utskrifter og 
kopiar av den.

This e-mail may contain confidential information, or otherwise 
be protected against unauthorised use. Any disclosure, distribution or 
other use of the information by anyone but the intended recipient is 
strictly prohibited. 
If you have received this e-mail in error, please advise the sender by 
immediate reply and destroy the received documents and any copies hereof.


PBefore 
printing, think about the environment


Reply via email to