On Wed, Feb 27, 2008 at 10:01:06AM -0600, Brian W. Barrett wrote: > The only solution to this problem is to suck it up and audit all the code > to eliminate calls to opal_progress() in situations where infinite > recursion can result. It's going to be long and painful, but there's no > quick fix (IMHO). > The trick is to call progress only from functions that are called directly by a user process. Never call progress from a callback functions. The main offenders of this rule are calls to OMPI_FREE_LIST_WAIT(). They should be changed to OMPI_FREE_LIST_GET() and dial with NULL return value.
-- Gleb.