[OMPI users] Interaction between Intel and OpenMPI floating point exceptions

Steve Lowder Mon, 6 Apr 2009 19:23:00 -0400

Recently I've been running an MPI code that uses the LAPACK slamchroutine to determine machine precision parameters. This software iscompiled using the latest Intel Fortran compiler and setting the -fpe0argument to watch for certain floating point errors. The slamchroutines crashed and printed an OpenMPI stacktrace to report anunderflow error, however the Intel -fpe0 setting doesn't abort onunderflow. When this software is not compiled and linked with OpenMPI,it ignores the underflow and doesn't abort when compiled with -fpe0.

When I run the MPI version and set --mca opal_signal 6,7,11 the codedoesn't abort on underflow. I'd like to know if I'm interpreting thisbehavior correctly, it appears that the mpi versus no mpi cases handleunderflow differently. I'm assuming OpenMPI has a handler that processesthe interrupts ahead of the Fortran RTL, stopping execution. Otherwisethe Fortran RTL handler would just ignore the underflow. Do I sort ofunderstand what is going on here? Is there another solution short ofthe --mca opal_signal switch?


thanks
Steve

[OMPI users] Interaction between Intel and OpenMPI floating point exceptions

Reply via email to