Abuli Palagashvili created KAFKA-10480:
------------------------------------------

             Summary: Kafka streams application is being stuck
                 Key: KAFKA-10480
                 URL: https://issues.apache.org/jira/browse/KAFKA-10480
             Project: Kafka
          Issue Type: Bug
          Components: consumer, streams
    Affects Versions: 2.2.0, 0.10.2.1
            Reporter: Abuli Palagashvili


*prerequisites:*
 * Kafka cluster running on version 0.10.2.1
 * Topic with 24 partitions, load up to 20k RPS, stored bare Strings with null 
keys
 * Kafka-streams application that reads records from source topic and writes to 
another, target partition defined from key extracted from record.Uses library 
version 2.2.0

*Problem:*

After application start all goes ok, but sometimes I get this message:

2020-09-10 20:09:41 WARN AbstractCoordinator:1119 - [Consumer 
clientId=sharder-application-1-8545e058-3494-4951-93d3-94bb4833be44-StreamThread-5-consumer,
 groupId=sharder-application-1] This member will leave the group because 
consumer poll timeout has expired. This means the time between subsequent calls 
to poll() was longer than the configured max.poll.interval.ms, which typically 
implies that the poll loop is spending too much time processing messages. You 
can address this either by increasing max.poll.interval.ms or by reducing the 
maximum size of batches returned in poll() with max.poll.records.

I tried playing with these config properties, but still face that problem.
My application processes records pretty fast and I think that it just loses 
connections to kafka cluster.Another problem is that I can't handle group 
member leaving, because application doesn't throw any exception and doesn't 
change it's state so I can't catch that situation as one does it here.
https://dzone.com/articles/whats-the-proper-kubernetes-health-check-for-a-kaf

Maybe somebody also faced this?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to