Re: [OMPI users] RoCE device performance with large message size

2017-10-10 Thread Jeff Squyres (jsquyres)
Probably want to check to make sure that lossless ethernet is enabled everywhere (that's a common problem I've seen); otherwise, you end up in timeouts and retransmissions. Check with your vendor on how to do layer-0 diagnostics, etc. Also, if this is a new vendor, they should probably try runn

[OMPI users] RoCE device performance with large message size

2017-10-10 Thread Brendan Myers
Hello All, I have a RoCE interoperability event starting next week and I was wondering if anyone had any ideas to help me with a new vendor I am trying to help get ready. I am using: * Open MPI 2.1 * Intel MPI Benchmarks 2018 * OFED 3.18 (requirement from vendor) *