[ https://issues.apache.org/jira/browse/FLINK-15132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Stephan Ewen closed FLINK-15132. -------------------------------- > Checkpoint Coordinator does Checkpoint I/O in JobMaster Main Thread > ------------------------------------------------------------------- > > Key: FLINK-15132 > URL: https://issues.apache.org/jira/browse/FLINK-15132 > Project: Flink > Issue Type: Bug > Components: Runtime / Checkpointing > Reporter: Stephan Ewen > Priority: Blocker > > The {{PendingCheckpoint.completePendingCheckpoint()}} method is called > synchronously from within the Scheduler / JobMaster Main Thread. > The method writes out the checkpoint metadata, which is a potentially > blocking I/O method. > Because the target may block arbitrarily long (for example S3 when load > throttling), this can bring down the entire cluster (blocking actor threads, > heartbeat timeouts). -- This message was sent by Atlassian Jira (v8.3.4#803005)