Re: [OMPI users] Interaction between Intel and OpenMPI floating point exceptions

Iain Bason Tue, 7 Apr 2009 10:58:33 -0400


On Apr 6, 2009, at 7:22 PM, Steve Lowder wrote:

Recently I've been running an MPI code that uses the LAPACK slamchroutine to determine machine precision parameters. This software iscompiled using the latest Intel Fortran compiler and setting the -fpe0 argument to watch for certain floating point errors. Theslamch routines crashed and printed an OpenMPI stacktrace to reportan underflow error, however the Intel -fpe0 setting doesn't abort onunderflow. When this software is not compiled and linked withOpenMPI, it ignores the underflow and doesn't abort when compiledwith -fpe0.
When I run the MPI version and set --mca opal_signal 6,7,11 the codedoesn't abort on underflow. I'd like to know if I'm interpretingthis behavior correctly, it appears that the mpi versus no mpi caseshandle underflow differently. I'm assuming OpenMPI has a handlerthat processes the interrupts ahead of the Fortran RTL, stoppingexecution. Otherwise the Fortran RTL handler would just ignore theunderflow. Do I sort of understand what is going on here? Is thereanother solution short of the --mca opal_signal switch?

Your analysis sounds about right to me. There are Fortran intrinsicroutines that can get those machine precision parameters instead ofslamch. Would it be feasible to modify the code to use them?


Iain

Re: [OMPI users] Interaction between Intel and OpenMPI floating point exceptions

Reply via email to