Trying to examine CUDA support in OpenMPI, using OpenMPI current feature series (v1.7). There was a question on this mailing list back in October 2011 (http://www.open-mpi.org/community/lists/users/2011/10/17539.php), about OpenMPI being able to use P2P transfers in case when two MPI processed involved in the transfer operation happens to execute on the same machine, and the answer was that this feature is being implemented. So my question is - what is the current status here, is this feature supported now?
Thanks.