Hi. We have started to scale up one of our codes and sometimes we get messages like this:
[c9-13.local:31125] Memory 0x2aaab7b64000:217088 cannot be freed from the registration cache. Possible memory corruption. It seems like the application runs normally and it does not crash becaus of this. Should we be worried? We have tested the code with up to 1700 cores and the message becomes more frequent as we scale up. System details: Rocks 5.2 (aka CentOS 5.3) x86_64 INTEL Compiler 11.1 OFED 1.4.1 OpenMPI 1.3.3 Best regards and Merry Christmas to all, r. -- The Computer Center, University of Tromsø, N-9037 TROMSØ Norway. phone:+47 77 64 41 07, fax:+47 77 64 41 00 Roy Dragseth, Team Leader, High Performance Computing Direct call: +47 77 64 62 56. email: roy.drags...@uit.no