Re: [OMPI users] poor performance using the openib btl

2014-06-24 Thread Maxime Boissonneault
What are your threading options for OpenMPI (when it was built) ? I have seen OpenIB BTL completely lock when some level of threading is enabled before. Maxime Boissonneault Le 2014-06-24 18:18, Fischer, Greg A. a écrit : Hello openmpi-users, A few weeks ago, I posted to the list about di

[OMPI users] poor performance using the openib btl

2014-06-24 Thread Fischer, Greg A.
Hello openmpi-users, A few weeks ago, I posted to the list about difficulties I was having getting openib to work with Torque (see "openib segfaults with Torque", June 6, 2014). The issues were related to Torque imposing restrictive limits on locked memory, and have since been resolved. Howeve

Re: [OMPI users] affinity issues under cpuset torque 1.8.1

2014-06-24 Thread Jeff Squyres (jsquyres)
Brock -- Can you run with "ompi_info --all"? With "--param all all", ompi_info in v1.8.x is defaulting to only showing level 1 MCA params. It's showing you all possible components and variables, but only level 1. Or you could also use "--level 9" to show all 9 levels. Here's the relevant se

[OMPI users] mpi prorg fails (big data)

2014-06-24 Thread Dr.Peer-Joachim Koch
Hi, one of our cluster users reported a problem with openmpi. He created a short sample (just a few lines) which will start and crash after a short time. We only see "Fatal error in PMPI_Gather: Other MPI error" - no further details. He is using an intel fortran compiler with a self compiled ope

Re: [OMPI users] affinity issues under cpuset torque 1.8.1

2014-06-24 Thread Ralph Castain
That's odd - it shouldn't truncate the output. I'll take a look later today - we're all gathered for a developer's conference this week, so I'll be able to poke at this with Nathan. On Mon, Jun 23, 2014 at 3:15 PM, Brock Palen wrote: > Perfection, flexible, extensible, so nice. > > BTW this do

Re: [OMPI users] affinity issues under cpuset torque 1.8.1

2014-06-24 Thread Ralph Castain
Let's say that the downside is an unknown at this time. The only real impact of setting that param is that each daemon now reports its topology at startup. Without the param, only the daemon on the first node does so. The concern expressed when we first added that report was that the volume of data

[OMPI users] Problem mpi

2014-06-24 Thread Diego Saúl Carrió Carrió
Dear all, I have problems for a long time related with mpirun. When I executed mpirun (with my program) I obtained the next error after a while: . . . . . mlx4: local QP operation err (QPN c00054, WQE index a, vendor syndrome 6f, opcode = 5e) [[64826,1],0][btl_openib_component.c:3497: