Yangyang ZHANG created FLINK-33127: -------------------------------------- Summary: HeapKeyedStateBackend: use buffered I/O to speed up local recovery Key: FLINK-33127 URL: https://issues.apache.org/jira/browse/FLINK-33127 Project: Flink Issue Type: Improvement Components: Runtime / State Backends Reporter: Yangyang ZHANG Attachments: thread_dump.png
Recently, I observed a slow restore case in local recovery using hashmap statebackend. It took 147 seconds to restore from a 467MB snapshot, 9 times slower than that (16s) when restore from remote fs. The thread dump show that It read local snapshot file directly by unbuffered FileInputStream / fs.local.LocalDataInputStream. !thread_dump.png! Maybe we can wrap with BufferInputStream to speed up local recovery. -- This message was sent by Atlassian Jira (v8.20.10#820010)