RE: Flink restarts on Checkpoint failure

2021-09-01 Thread Schwalbe Matthias
look into when I run into similar situations Feel free to get back to the mailing list for further clarifications … Thias From: Caizhi Weng Sent: Donnerstag, 2. September 2021 04:24 To: Daniel Vol Cc: user Subject: Re: Flink restarts on Checkpoint failure Hi! There are a ton of possible

Re: Flink restarts on Checkpoint failure

2021-09-01 Thread Caizhi Weng
Hi! There are a ton of possible reasons for a checkpoint failure. The most possible reasons might be * The JVM is busy with garbage collecting when performing the checkpoints. This can be checked by looking into the GC logs of a task manager. * The state suddenly becomes quite large due to some sp

Flink restarts on Checkpoint failure

2021-09-01 Thread Daniel Vol
Hello, I see the following error in my jobmanager log (Flink on EMR): Checking cluster logs I see : 2021-08-21 17:17:30,489 [Checkpoint Timer] INFO org.apache.flink.runtime.checkpoint.CheckpointCoordinator - Triggering checkpoint 1 (type=CHECKPOINT) @ 1629566250303 for job c513e9ebbea4ab72d80b133