Re: [OMPI users] Double free in collectives

2019-03-14 Thread Gilles Gouaillardet
Jeff, The first location is indeed in ompi_coll_libnbc_iallreduce() Lee Ann, thanks for the bug report, for the time being, can you please give the attached patch a try ? Cheers, Gilles FWIW NBC_Schedule_request() sets handle->tmpbuf = tmpbuf and call NBC_Start(handle, schedule)

Re: [OMPI users] OpenMPI behavior with Ialltoall and GPUs

2019-03-14 Thread Adam Sylvester
FYI for others that have run into the same problem, see https://github.com/openucx/ucx/issues/3359. In short: 1. Use UCX 1.5 rather than 1.4 (I recommend updating https://www.open-mpi.org/faq/?category=buildcuda) 2. Dynamically link in the cudart library (by default nvcc will statically link it).

Re: [OMPI users] Double free in collectives

2019-03-14 Thread Jeff Squyres (jsquyres) via users
Lee Ann -- Thanks for your bug report. I'm not able to find a call to NBC_Schedule_request() in ompi_coll_libnbc_iallreduce(). I see 2 calls to NBC_Schedule_request() in ompi/mca/coll/libnbc/nbc_iallreduce.c, but they are in different functions. Can you clarify exactly which one(s) you're ref

[OMPI users] Double free in collectives

2019-03-14 Thread Riesen, Lee Ann
I'm trying to build OpenMPI 3.1.2 as part of Mellanox HPC-X and I'm having some problems with the underlying libraries. The true problem was masked for awhile by an bug in error handling in OpenMPI. In mca/coll/libnbc/nbc_iallreduce.c in function ompi_coll_libnbc_iallreduce() we have some erro

Re: [OMPI users] Best way to send on mpi c, architecture dependent data type

2019-03-14 Thread Nathan Hjelm via users
Why not just use C99 stdint? That gives you fixes-size types. -Nathan > On Mar 14, 2019, at 9:38 AM, George Reeke wrote: > > On Wed, 2019-03-13 at 22:10 +, Sergio None wrote: >> Hello. >> >> >> I'm using OpenMPI 3.1.3 on x64 CPU and two ARMv8( Raspberry pi 3). >> >> >> But i'm having s

Re: [OMPI users] Best way to send on mpi c, architecture dependent data type

2019-03-14 Thread George Reeke
On Wed, 2019-03-13 at 22:10 +, Sergio None wrote: > Hello. > > > I'm using OpenMPI 3.1.3 on x64 CPU and two ARMv8( Raspberry pi 3). > > > But i'm having some issues with data types that are architecture > dependent, like 'long'. > -trimmed- > > So my question is: there

Re: [OMPI users] [SciPy-Dev] Fwd: Announcement and thanks to Season of Docs survey respondents: Season of Docs has launched

2019-03-14 Thread Jeff Squyres (jsquyres) via users
Hmm -- yes, this could be quite interesting. We always have a need for documentation to be updated! I'm afraid that I already have direct responsibility for an intern this summer and some other "manage people" kinds of duties that mean that I will not have time to be a mentor in this program, t