Re: KTable.suppress(Suppressed.untilWindowCloses) does not suppress some non-final results when the kafka streams process is restarted

2019-01-23 Thread Peter Levart
Hi John, Sorry I haven't had time to prepare the minimal reproducer yet. I still have plans to do it though... On 1/22/19 8:02 PM, John Roesler wrote: Hi Peter, Just to follow up on the actual bug, can you confirm whether: * when you say "restart", do you mean orderly shutdown and restart, o

Re: Open files clogging and KafkaStreams

2019-01-23 Thread Niklas Lönn
Hi Guozhang, I think I went a bit ahead of myself in describing the situation, I had an attachment with the context in detail, maybe it was filtered out. Lets try again =) We have a topology looking something like this: input-topic[20 partitions, compacted] | use-case-repartition[20 partitio

Re: Kafka Consumer Not Assigned Partitions

2019-01-23 Thread chinchu chinchu
Hello, I have subscribed to a kafka topic as below . I need to run some logic only after the consumer has been assigned a partition .How ever consumer.assignment() comes back as an empty set no matter how long I wait . If I do not have the while loop and then do a consumer.poll() I do get the r

Kafka's cluster for thousands of clients (up to 30k)

2019-01-23 Thread Романенко Денис Владимирович
Hi all! We're trying to use Kafka to deliver messages to thousands of clients (different data for each client). Actually we want to get behavior pretty much like message broker, because we have to work with continious flow of small data pieces. Suppose, we should create a partition for each cl

Re: Open files clogging and KafkaStreams

2019-01-23 Thread Guozhang Wang
I see. What you described is a known issue in the older version of Kafka, that some high traffic topics in the bootstrap mode may effectively "starve" other topics in the fetch response, since brokers used to naively fill in the bytes that meets the max.bytes configuration and returns. This is fix

Error deserializing Avro message when using SpecificSerde

2019-01-23 Thread chinchu chinchu
Hey folks, I am getting the below error when reading data from a kafka topic . I have used confluent serializers to serialize this data but when trying to consume using confuent deserializer running into the below error.Any idea on what the issue could be here ?.Also how do I skip this recor

Re: Kafka Consumer Not Assigned Partitions

2019-01-23 Thread Matthias J. Sax
Calling `consumer.subscribe()` is a local call. Only when you call `consumer.poll()` the consumer will connect to the broker to get its assignment. Thus, it's save to call `poll()` directly. `assignment()` will return the assignment only after the first `poll()` call. -Matthias On 1/23/19 9:00

Re: Error deserializing Avro message when using SpecificSerde

2019-01-23 Thread chinchu chinchu
My application did not have the com.test.model.avro.Log class during run time . The log messages indicated this,it was just my oversight. On Wed, Jan 23, 2019 at 10:28 AM chinchu chinchu wrote: > Hey folks, > I am getting the below error when reading data from a kafka topic . I > have used

Re: Kafka Consumer Not Assigned Partitions

2019-01-23 Thread chinchu chinchu
Thanks Matthias. On Wed, Jan 23, 2019 at 10:55 AM Matthias J. Sax wrote: > Calling `consumer.subscribe()` is a local call. Only when you call > `consumer.poll()` the consumer will connect to the broker to get its > assignment. Thus, it's save to call `poll()` directly. > > `assignment()` will re

Re: Open files clogging and KafkaStreams

2019-01-23 Thread Niklas Lönn
I have to double check what version of broker we run in production but when testing and verifying the issue locally I did reproduce it with both broker and client version 2.1.0 Kind regards Niklas On Wed 23. Jan 2019 at 18:24, Guozhang Wang wrote: > I see. > > What you described is a known issu

Kafka Hdfs Connect Flush Size

2019-01-23 Thread chinchu chinchu
Hey folks, I have been going through the hdfs connector code . I have a one question. Is the flush size in connector config the number of records read from a kafka partition or the number of records written to an hdfs path?. Looks like the recordCounter in TopicPartitionWriter is incremented

Re: Open files clogging and KafkaStreams

2019-01-23 Thread Guozhang Wang
I see (btw attachments are usually not allowed in AK mailing list, but if you have it somewhere like gitcode and can share the url that works). Could you let me know how many physical cores do you have in total hosting your app and how many threads did you configure? From your current description

Single consumer subscribing to multiple topics

2019-01-23 Thread Manu Jacob
Hi Experts, I have a requirement to consume records from multiple topics and process those messages with a similar pattern. The schema of the messages in each topic is different and I am consuming it as Generic records. Both these topics have very high rate of traffic (10-15K messages/second) a