When I checkpoint my openmpi application using ompi_checkpoint, I see
that top command suddenly shows some really huge numbers in "CPU %"
field such as 150% 200% etc. After sometime, these numbers do come back
to the normal numbers under 100%. This happens exactly around the time
checkpoint is comp
I am observing a very strange performance issue with my openmpi program.
I have compute intensive openmpi based application that keeps the data
in memory, process the data and then dumps it to GPFS parallel file
system. GPFS parallel file system server is connected to a QDR
infiniband switch fro