Re: Struggling with Kafka Streams rebalances under load / in production

2016-12-12 Thread Guozhang Wang
Robert, To validate if a rebalance happens, you can check the server-side logs starting with "Preparing to restabilize group %s with old generation..", and if that is triggered by a consumer failure detected, it will have some entries like "Member XX in group YY has failed" before the "preparing"

Re: Struggling with Kafka Streams rebalances under load / in production

2016-12-12 Thread Jay Kreps
I think the most common cause of rebalancing is still GC that exceeds the consumer liveness timeout you've configured. Might be worth enabling GC logging in java and then checking the pause times. If they exceed the timeout you have for liveness then you will detect that as a process failure and re

Re: Struggling with Kafka Streams rebalances under load / in production

2016-12-11 Thread Damian Guy
Hi Rob, Do you have any further information you can provide? Logs etc? Have you configured max.poll.interval.ms? Thanks, Damian On Sun, 11 Dec 2016 at 20:30 Robert Conrad wrote: > Hi All, > > I have a relatively complex streaming application that seems to struggle > terribly with rebalance iss

Struggling with Kafka Streams rebalances under load / in production

2016-12-11 Thread Robert Conrad
Hi All, I have a relatively complex streaming application that seems to struggle terribly with rebalance issues while under load. Does anyone have any tips for investigating what is triggering these frequent rebalances or particular settings I could experiment with to try to eliminate them? Origi