I have already implemented test/sleep code but the main problem is with the
broadcasts that send out the SIMD instructions, because these are blocking and
when the system is idle, its these guys who consume the CPU while waiting for
work.
Implementing
echo "1"
> /proc/sys/kernel/sched_compat_y
Hi Folks,
I have a run on 256 PEs onot a lustre file system with the following code:
[snip]
integer :: mype,npe,pe_min,pe_max,pe_prev,pe_next,mpi_my_real, &
comm=mpi_comm_world,status(mpi_status_size),error, &
mpi_realsize, thefile
integer (kind=MPI_OFFSET_KIND) disp