[ 
https://issues.apache.org/jira/browse/FLINK-30623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17677312#comment-17677312
 ] 

Rui Fan commented on FLINK-30623:
---------------------------------

Thanks [~Yanfei Lei]   reports this issue, and [~martijnvisser] [~pnowojski] 
discuss here.


After analysis, I think the root cause is FLINK-30533 doesn't check 
`recordWriter.isAvailable()` in the loop of SourceOperator.

When the recordWriter is unavailable, task shouldn't send data to downstream 
task, the task thread may get stuck in requestMemoryBuffer due to backpressure. 
The checkpoint barrier cannot be handler when task is stucking in 
requestMemoryBuffer, that's why performance regression for unaligned 
checkpoint. 

cc [~lindong] 

 

> Performance regression in checkpointSingleInput.UNALIGNED on 04.01.2023
> -----------------------------------------------------------------------
>
>                 Key: FLINK-30623
>                 URL: https://issues.apache.org/jira/browse/FLINK-30623
>             Project: Flink
>          Issue Type: Bug
>          Components: Benchmarks, Runtime / Checkpointing
>            Reporter: Martijn Visser
>            Priority: Blocker
>
> Performance regression
> checkpointSingleInput.UNALIGNED median=338.1445195 recent_median=67.6453005
> checkpointSingleInput.UNALIGNED_1 median=213.230041 recent_median=39.830277
> deployAllTasks.STREAMING median=168.533106 recent_median=159.8534395
> stateBackends.MEMORY median=3229.0248875 recent_median=2985.782919
> tupleKeyBy median=4155.684199 recent_median=3987.5812305
> http://codespeed.dak8s.net:8000/timeline/#/?exe=1&ben=checkpointSingleInput.UNALIGNED&extr=on&quarts=on&equid=off&env=2&revs=200
> http://codespeed.dak8s.net:8000/timeline/#/?exe=1&ben=checkpointSingleInput.UNALIGNED_1&extr=on&quarts=on&equid=off&env=2&revs=200
> http://codespeed.dak8s.net:8000/timeline/#/?exe=8&ben=deployAllTasks.STREAMING&extr=on&quarts=on&equid=off&env=2&revs=200
> http://codespeed.dak8s.net:8000/timeline/#/?exe=6&ben=stateBackends.MEMORY&extr=on&quarts=on&equid=off&env=2&revs=200
> http://codespeed.dak8s.net:8000/timeline/#/?exe=6&ben=tupleKeyBy&extr=on&quarts=on&equid=off&env=2&revs=200



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to