I'm trying to build OpenMPI 3.1.2 as part of Mellanox HPC-X and I'm having some problems with the underlying libraries. The true problem was masked for awhile by an bug in error handling in OpenMPI. In mca/coll/libnbc/nbc_iallreduce.c in function ompi_coll_libnbc_iallreduce() we have some error handling at the end that looks like:
res = NBC_Schedule_request (schedule, comm, libnbc_module, request, tmpbuf); if (OPAL_UNLIKELY(OMPI_SUCCESS != res)) { OBJ_RELEASE(schedule); free(tmpbuf); return res; } The Schedule_request call failed, and in that call the "schedule" and "tmpbuf" were freed. Then we return and again, the "schedule" and "tmpbuf" are freed. It looks like this occurs elsewhere in the source file too. Lee Ann ----- Lee Ann Riesen, Enterprise and Government Group, Intel Corporation, Hillsboro, OR Phone 503-613-1952
_______________________________________________ users mailing list users@lists.open-mpi.org https://lists.open-mpi.org/mailman/listinfo/users