Re: Difficult to debug reason for checkpoint decline

2019-10-07 Thread Chesnay Schepler
There does indeed appear to be a code path in the StreamTask where an exception might not be logger on the TaskExecutor. (StreamTask#handleExecutionException) In FLINK-10753 the CheckpointCoordinator was adjusted to log the full stacktrace, and is part of 1.5.6. On 07/10/2019 09:51, Daniel Ha

Difficult to debug reason for checkpoint decline

2019-10-07 Thread Daniel Harper
We had an issue recently where no checkpoints were able to complete, with the following message in the job manager logs 2019-09-25 12:27:57,159 INFO org.apache.flink.runtime.checkpoint.CheckpointCoordinator - Decline checkpoint 7041 by task 1f789ac3c5df655fe5482932b2255fd3 of job 214ccf9a