Matthias J. Sax created KAFKA-4718: -------------------------------------- Summary: Revisit DSL partitioning assumption for KStream source topics Key: KAFKA-4718 URL: https://issues.apache.org/jira/browse/KAFKA-4718 Project: Kafka Issue Type: Improvement Components: streams Reporter: Matthias J. Sax Priority: Minor
Currently, when reading one or multiple topics via a single call to {{KStreamBuilder#stream()}}, it is assumed that the data is correctly partitions by key. For "single topic" {{KStream}}, this is a fair assumption, however, for multi-topic {{KStream}}, the assumption is most likely not true if input topics have a different number of partitions, because producer use hash partitioning per default. Thus, to get correct partitions, all producer for those input topics need to use (the same or at least a compatible) custom partitioner. Making this the default assumption seem rather risky, and we might want to revisit this. Or at least update some docs with corresponding hints. -- This message was sent by Atlassian JIRA (v6.3.15#6346)