Re: Exceeded Checkpoint tolerable failure threshold Exception

2021-10-07 Thread Caizhi Weng
Hi! You need to look into the root cause of checkpoint failure. You can see the "Checkpoint" tab to see if checkpointing timeout occurs or see the "Exception" tab for exception messages other than this one. You can also dive into the logs for suspicious information. If checkpoint failures are rar

Exceeded Checkpoint tolerable failure threshold Exception

2021-10-07 Thread Robert Cullen
I have Flink set up with 2 taskmanagers and one jobmanager. I've allocated 25 gb of JVM Heap and 15 gb of Flink managed memory. I have 2 jobs running. After 3 hours this exception was thrown. How can I configure flink to prevent this from happening? 2021-10-07 12:38:50 org.apache.flink.util.Fl