Jeff,
The first location is indeed in ompi_coll_libnbc_iallreduce()
Lee Ann,
thanks for the bug report,
for the time being, can you please give the attached patch a try ?
Cheers,
Gilles
FWIW
NBC_Schedule_request() sets handle->tmpbuf = tmpbuf and call
NBC_Start(handle, schedule)
FYI for others that have run into the same problem, see
https://github.com/openucx/ucx/issues/3359. In short:
1. Use UCX 1.5 rather than 1.4 (I recommend updating
https://www.open-mpi.org/faq/?category=buildcuda)
2. Dynamically link in the cudart library (by default nvcc will statically
link it).
Lee Ann --
Thanks for your bug report.
I'm not able to find a call to NBC_Schedule_request() in
ompi_coll_libnbc_iallreduce().
I see 2 calls to NBC_Schedule_request() in
ompi/mca/coll/libnbc/nbc_iallreduce.c, but they are in different functions.
Can you clarify exactly which one(s) you're ref
I'm trying to build OpenMPI 3.1.2 as part of Mellanox HPC-X and I'm having some
problems with the underlying libraries. The true problem was masked for awhile
by an bug in error handling in OpenMPI. In mca/coll/libnbc/nbc_iallreduce.c in
function ompi_coll_libnbc_iallreduce() we have some erro
Why not just use C99 stdint? That gives you fixes-size types.
-Nathan
> On Mar 14, 2019, at 9:38 AM, George Reeke wrote:
>
> On Wed, 2019-03-13 at 22:10 +, Sergio None wrote:
>> Hello.
>>
>>
>> I'm using OpenMPI 3.1.3 on x64 CPU and two ARMv8( Raspberry pi 3).
>>
>>
>> But i'm having s
On Wed, 2019-03-13 at 22:10 +, Sergio None wrote:
> Hello.
>
>
> I'm using OpenMPI 3.1.3 on x64 CPU and two ARMv8( Raspberry pi 3).
>
>
> But i'm having some issues with data types that are architecture
> dependent, like 'long'.
>
-trimmed-
>
> So my question is: there
Hmm -- yes, this could be quite interesting.
We always have a need for documentation to be updated!
I'm afraid that I already have direct responsibility for an intern this summer
and some other "manage people" kinds of duties that mean that I will not have
time to be a mentor in this program, t