> Thus, I still don't see what have happened at Imseih's hand, but I can > cause PANIC with a bit tricky steps, which I don't think valid. This > is what I wanted to know the exact steps to cause the PANIC.
> The attached 1 is the PoC of the TAP test (it uses system()..), and > the second is a tentative fix for that. (I don't like the fix, too, > though...) It is been difficult to get a generic repro, but the way we reproduce Is through our test suite. To give more details, we are running tests In which we constantly failover and promote standbys. The issue surfaces after we have gone through a few promotions which occur every few hours or so ( not really important but to give context ). I am adding some additional debugging to see if I can draw a better picture of what is happening. Will also give aborted_contrec_reset_3.patch a go, although I suspect it will not handle the specific case we are deaing with. Regards, Sami imseih Amazon Web Services