[ https://issues.apache.org/jira/browse/KAFKA-4494?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15870914#comment-15870914 ]
ASF GitHub Bot commented on KAFKA-4494: --------------------------------------- GitHub user dguy opened a pull request: https://github.com/apache/kafka/pull/2560 KAFKA-4494: Reduce startup and rebalance time Replace one-by-one initialization of state stores with bulk initialization. You can merge this pull request into a Git repository by running: $ git pull https://github.com/dguy/kafka kafka-4494 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/kafka/pull/2560.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #2560 ---- commit 47a8cecb3494e4496d5e243e633bbedf81f2a967 Author: Damian Guy <damian....@gmail.com> Date: 2017-02-14T00:48:48Z Bulk initialization of state stores ---- > Significant startup delays in KStreams app > ------------------------------------------ > > Key: KAFKA-4494 > URL: https://issues.apache.org/jira/browse/KAFKA-4494 > Project: Kafka > Issue Type: Bug > Components: streams > Affects Versions: 0.10.1.0 > Environment: AWS Linux ami, mac os > Reporter: j yeargers > Assignee: Damian Guy > Labels: performance > > Often starting a KStreams based app results in significant (5-10 minutes) > delay before processing of stream begins. > Sample debug output: > https://gist.github.com/jyeargers/e8398fb353d67397f99148bc970479ee > Topology in question: stream -> map -> groupbykey.aggregate -> print > Stream is JSON. -- This message was sent by Atlassian JIRA (v6.3.15#6346)