[OMPI users] OpenMPI with PSM on True Scale with OmniPath drivers

2018-01-22 Thread William Hay
We have a couple of clusters with Qlogic Infinipath/Intel TrueScale networking. While testing a kernel upgrade we find that the Truescale drivers will no longer build against recent RHEL kernels. Intel tells us that the Omnipath drivers will work for True Scale adapters so we install those. Basi

Re: [OMPI users] Startup limited to 128 remote hosts in some situations?

2017-01-18 Thread William Hay
On Tue, Jan 17, 2017 at 09:56:54AM -0800, r...@open-mpi.org wrote: > As I recall, the problem was that qrsh isn???t available on the backend > compute nodes, and so we can???t use a tree for launch. If that isn???t true, > then we can certainly adjust it. > qrsh should be available on all nodes o

Re: [OMPI users] BLCR + Qlogic infiniband

2012-12-04 Thread William Hay
On 28 November 2012 11:14, William Hay wrote: > I'm trying to build openmpi with support for BLCR plus qlogic infiniband > (plus grid engine). Everything seems to compile OK and checkpoints are > taken but whenever I try to restore a checkpoint I get the following erro

[OMPI users] BLCR + Qlogic infiniband

2012-11-28 Thread William Hay
I'm trying to build openmpi with support for BLCR plus qlogic infiniband (plus grid engine). Everything seems to compile OK and checkpoints are taken but whenever I try to restore a checkpoint I get the following error: - do_mmap(, 2aaab18c7000, 1000, ...) failed: ffea