Flink version: 1.10.0
2023-07-19 12:33:52
org.apache.flink.util.FlinkRuntimeException: Exceeded checkpoint tolerable
failure threshold.
    at org.apache.flink.runtime.checkpoint.CheckpointFailureManager
.handleTaskLevelCheckpointException(CheckpointFailureManager.java:87)
    at org.apache.flink.runtime.checkpoint.CheckpointCoordinator
.failPendingCheckpointDueToTaskFailure(CheckpointCoordinator.java:1467)
    at org.apache.flink.runtime.checkpoint.CheckpointCoordinator
.discardCheckpoint(CheckpointCoordinator.java:1377)
    at org.apache.flink.runtime.checkpoint.CheckpointCoordinator
.receiveDeclineMessage(CheckpointCoordinator.java:719)
    at org.apache.flink.runtime.scheduler.SchedulerBase
.lambda$declineCheckpoint$5(SchedulerBase.java:807)
    at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:
511)
    at java.util.concurrent.FutureTask.run(FutureTask.java:266)
    at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask
.access$201(ScheduledThreadPoolExecutor.java:180)
    at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask
.run(ScheduledThreadPoolExecutor.java:293)
    at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor
.java:1149)
    at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor
.java:624)
    at java.lang.Thread.run(Thread.java:748)

Please help me, how to fix the issue
Job is recovering. but i dont want restart my job. because inprogress file
are not marked as done.
Regards,
Nagireddy Y.

On Wed, Jul 19, 2023 at 5:55 PM Y SREEKARA BHARGAVA REDDY <
ynagiredd...@gmail.com> wrote:

> Flink is restarting daily once.
> Flink version: 1.10.0
> 2023-07-19 12:33:52
> org.apache.flink.util.FlinkRuntimeException: Exceeded checkpoint
> tolerable failure threshold.
>     at org.apache.flink.runtime.checkpoint.CheckpointFailureManager
> .handleTaskLevelCheckpointException(CheckpointFailureManager.java:87)
>     at org.apache.flink.runtime.checkpoint.CheckpointCoordinator
> .failPendingCheckpointDueToTaskFailure(CheckpointCoordinator.java:1467)
>     at org.apache.flink.runtime.checkpoint.CheckpointCoordinator
> .discardCheckpoint(CheckpointCoordinator.java:1377)
>     at org.apache.flink.runtime.checkpoint.CheckpointCoordinator
> .receiveDeclineMessage(CheckpointCoordinator.java:719)
>     at org.apache.flink.runtime.scheduler.SchedulerBase
> .lambda$declineCheckpoint$5(SchedulerBase.java:807)
>     at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:
> 511)
>     at java.util.concurrent.FutureTask.run(FutureTask.java:266)
>     at java.util.concurrent.
> ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(
> ScheduledThreadPoolExecutor.java:180)
>     at java.util.concurrent.
> ScheduledThreadPoolExecutor$ScheduledFutureTask.run(
> ScheduledThreadPoolExecutor.java:293)
>     at java.util.concurrent.ThreadPoolExecutor.runWorker(
> ThreadPoolExecutor.java:1149)
>     at java.util.concurrent.ThreadPoolExecutor$Worker.run(
> ThreadPoolExecutor.java:624)
>     at java.lang.Thread.run(Thread.java:748)
>
> Please help me, how to fix the issue
> Job is recovering. but i dont want restart my job. because inprogress file
> are not marked as done.
> Regards,
> Nagireddy Y.
>
>

Reply via email to