[ https://issues.apache.org/jira/browse/FLINK-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14646634#comment-14646634 ]
ASF GitHub Bot commented on FLINK-2324: --------------------------------------- Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/937#issuecomment-126067262 That would be good ^^ then it's :+1: from me, at least for now. It's generally good performance-wise to have less serialised states. This means that we will have a constant number of issued writes to external storage (== #subtasks). On the other hand this also makes our life harder a bit when it comes to repartitioning, as you already mentioned we need to revisit this. > Rework partitioned state storage > -------------------------------- > > Key: FLINK-2324 > URL: https://issues.apache.org/jira/browse/FLINK-2324 > Project: Flink > Issue Type: Improvement > Reporter: Gyula Fora > Assignee: Gyula Fora > > Partitioned states are currently stored per-key in statehandles. This is > alright for in-memory storage but is very inefficient for HDFS. > The logic behind the current mechanism is that this approach provides a way > to repartition a state without fetching the data from the external storage > and only manipulating handles. > We should come up with a solution that can achieve both. -- This message was sent by Atlassian JIRA (v6.3.4#6332)