Hi All,
when the nodes belong to different subnets the following error messages
pop up:
[powerbook.2-net:20826] *** An error occurred in MPI_Allreduce
[powerbook.2-net:20826] *** on communicator MPI_COMM_WORLD
[powerbook.2-net:20826] *** MPI_ERR_INTERN: internal error
[powerbook.2-net:20826] **
A few clarifying questions:
What is your netmask on these hosts?
Where is the MPI_ALLREDUCE in your app -- right away, or somewhere deep
within the application? Can you replicate this with a simple MPI
application that essentially calls MPI_INIT, MPI_ALLREDUCE, and
MPI_FINALIZE?
Can you replica