[ https://issues.apache.org/jira/browse/FLINK-12699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Yu Li reassigned FLINK-12699: ----------------------------- Assignee: PengFei Li (was: Yu Li) > Reduce CPU consumption when snapshot/restore the spilled key-group > ------------------------------------------------------------------ > > Key: FLINK-12699 > URL: https://issues.apache.org/jira/browse/FLINK-12699 > Project: Flink > Issue Type: Sub-task > Components: Runtime / State Backends > Reporter: Yu Li > Assignee: PengFei Li > Priority: Major > > We need to prevent the unnecessary de/serialization when > snapshotting/restoring the spilled state key-group. To achieve this, we need > to: > 1. Add meta information for {{HeapKeyedStatebackend}} checkpoint on DFS, > separating the on-heap and on-disk part > 2. Write the off-heap bytes directly to DFS when checkpointing and mark it as > on-disk > 3. Directly write the bytes onto disk when restoring the data back from DFS, > if it's marked as on-disk > Notice that we cannot directly use file copy since we use mmap meanwhile > support copy-on-write. -- This message was sent by Atlassian JIRA (v7.6.3#76005)