[ https://issues.apache.org/jira/browse/FLINK-15670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17019824#comment-17019824 ]
Jiangjie Qin commented on FLINK-15670: -------------------------------------- I am wondering how do we expose this feature to the users? One possibility is to integrate this with interactive queries (i.e. {{#cache()}} method) with an intermediate result storage set to Kafka. For SQL, a temporary table DDL could be introduced to mark a materialized temporary table store in Kafka by using these connectors. The {{#cache()}} method can also take some configurations which allows user to set retention time. Something similar can be done for the temp table DDL also. > Provide a Kafka Source/Sink pair that aligns Kafka's Partitions and Flink's > KeyGroups > ------------------------------------------------------------------------------------- > > Key: FLINK-15670 > URL: https://issues.apache.org/jira/browse/FLINK-15670 > Project: Flink > Issue Type: New Feature > Components: API / DataStream, Connectors / Kafka > Reporter: Stephan Ewen > Priority: Major > Labels: usability > Fix For: 1.11.0 > > > This Source/Sink pair would serve two purposes: > 1. You can read topics that are already partitioned by key and process them > without partitioning them again (avoid shuffles) > 2. You can use this to shuffle through Kafka, thereby decomposing the job > into smaller jobs and independent pipelined regions that fail over > independently. -- This message was sent by Atlassian Jira (v8.3.4#803005)