Hi there!
I’m newbie on Kafka.
I have an interest in streaming service, especially Kafka streaming. But I have
no Idea what’s the difference between Kafka streaming and samza.
Both has similiar architecture and functionality, I think.
What’s the main difference? What’s the pros and cons? It’s
We have problem with one of partition on _consumer_offset topic. It reached 77G
and growing.
77G__consumer_offsets-49
0__consumer_offsets-5
Our retention policy set for 100mb but apparently deleting of old data does not
happens.
log.retention.bytes=104857600
kafka_log_cleanup_inter
I narrowed this problem down to this part of the topology (and yes, it's
100% repro - for me):
KStream transactionKStream =
kStreamBuilder.stream(stringSerde,transactionSerde,TOPIC);
KTable, SumRecordCollector> ktAgg =
transactionKStream.groupByKey().aggregate(
SumRecordCollector::new,