Stephan Ewen created FLINK-16443: ------------------------------------ Summary: Fix wrong fix for user-code CheckpointExceptions Key: FLINK-16443 URL: https://issues.apache.org/jira/browse/FLINK-16443 Project: Flink Issue Type: Bug Components: Runtime / Checkpointing Reporter: Stephan Ewen Fix For: 1.11.0
The problem of having exceptions that are only in the user code classloader was fixed by proactively serializing them inside the {{CheckpointException}}. That means all consumers of {{CheckpointException}} now need to be aware of that and unwrap the serializable exception. I believe the right way to fix this would have been to use a SerializedException in the {{DeclineCheckpoint}} message instead, which would have localized the change to the actual problem: RPC transport. I would suggest to revert https://github.com/apache/flink/pull/9742 and instead apply the above described change. -- This message was sent by Atlassian Jira (v8.3.4#803005)