[OMPI users] Fault in not recycling bsend buffer ?

2020-03-17 Thread Martyn Foster via users
Hi all, I'm new here, so please be gentle :-) Versions: OpenMPI 4.0.3rc1, UCX 1.7 I have a hang in an application (OK for small data sets, but fails with a larger one). The error is "bsend: failed to allocate buffer" This comes from pml_ucx.c:693 mca_pml_ucx_bsend( ... ) ... packed_data = mca

Re: [OMPI users] Fault in not recycling bsend buffer ?

2020-03-17 Thread George Bosilca via users
Martyn, I don't know exactly what your code is doing, but based on your inquiry I assume you are using MPI_BSEND multiple times and you run out of local buffers. The MPI standard does not mandate a wait until buffer space becomes available, because that can lead to deadlocks (communication patter

Re: [OMPI users] Limits of communicator size and number of parallel broadcast transmissions

2020-03-17 Thread George Bosilca via users
On Mon, Mar 16, 2020 at 6:15 PM Konstantinos Konstantinidis via users < users@lists.open-mpi.org> wrote: > Hi, I have some questions regarding technical details of MPI collective > communication methods and broadcast: > >- I want to understand when the number of receivers in a MPI_Bcast can >