Kafka streams parallelism - why not separate stream task per partition per input topic

2020-09-02 Thread Pushkar Deole
Hi, I came across articles where it is explained how parallelism is handled in kafka streams. This is what I collected: When the streams application is reading from multiple topics, the topic with maximum number of partitions is considered for instantiating stream tasks so 1 task is instantiated p

Re: UPGRADING ZOOKEEPER FROM 3.4.13 TO 3.5.7

2020-09-02 Thread Enrico Olivelli
The official way to fix it is here https://issues.apache.org/jira/browse/ZOOKEEPER-3056 Basically we have a flag to allow the boot even in that case. I suggest you to upgrade to latest 3.5.8 and not to 3.5.7 Enrico Il Gio 3 Set 2020, 03:51 Rijo Roy ha scritto: > Hi Manoj, > I just faced it ye

Re: UPGRADING ZOOKEEPER FROM 3.4.13 TO 3.5.7

2020-09-02 Thread Rijo Roy
Hi Manoj, I just faced it yesterday and resolved.. Hope you are getting this error in one of the follower node, if yesPlease create a backup folder in your zookeeper data directory and move version_2 that holds zookeeper data into the newly created backup folder. Starting the zookeeper process wi

Re: [External] key and subkey sequential processing across competing consumer groups?

2020-09-02 Thread Andre Mermegas
kinda: A:X:FOO B:X:FOO B:Y:BAR B:Z:BAZ A:X:FOO and B:X:FOO can be processed in parallel, but B:Z:BAZ must be processed after B:Y:BAR and after B:X:FOO where A and B go to some partition for sequencing, but then "X", "Y" and "Z" go to a different sub topic and partition for processing. sequencing

Re: [External] key and subkey sequential processing across competing consumer groups?

2020-09-02 Thread Tauzell, Dave
So if the stream is: A:1:FOO A:3:BAR A:3:BAZ Then A:3* must be processed after A:1 but A:3:BAR and A:3:BAZ can be processed in any order? I don’t think there is a way to do that with topics. -Dave From: Andre Mermegas Reply-To: "users@kafka.apache.org" Date: Wednesday, September 2, 2020 at

UPGRADING ZOOKEEPER FROM 3.4.13 TO 3.5.7

2020-09-02 Thread Manoj.Agrawal2
HI ALL , I’m planning to upgrade the Kafka 2.2.1 to kafka 2.5.0 , I m getting below error while upgrading zookeeper version as below . Any idea ? java.io.IOException: No snapshot found, but there are log entries. Something is broken! at org.apache.zookeeper.server.persistence.FileTx

key and subkey sequential processing across competing consumer groups?

2020-09-02 Thread Andre Mermegas
Hi All, New to kafka and would love some feedback on how to think about a solution for this kind of flow: So, sequencing must be maintained for events of type X:Y:PAYLOAD Where X is a grouping mechanism and all X must be processed sequentially e.g. account number, (ok i use key here to keep thes

Re: Streams constantly reblancing,

2020-09-02 Thread Sophie Blee-Goldman
Hey steve, I guess the first question I have is, is the consumer group actually rebalancing or does KafkaStreams just appear to be "stuck" in the REBALANCING state? If this application has been run before and there is a large amount of data in the changelog topics, it might take a while to rebuild

Re: Streams constantly reblancing,

2020-09-02 Thread Steve Jones
Just an addition to this in case someone can help, I'm seeing zero thread activity in the rebalancing but I'm also not seeing anything being logged by Kafka to indicate its doing anything on rebalancing (Logging level set to ALL), producer and consumer code works fine but streams just appears to ha

Advice for Kafka project in Africa...

2020-09-02 Thread cedric sende lubuele
Let me introduce myself, my name is Cedric and I am a network engineer passionate about new technologies and as part of my new activity, I am interested in Big Data. Currently, I live in Africa (Congo) and as everyone knows, Africa is very late in terms of IT infrastructure (the Cloud is diffic

A question related to quota in Kafka

2020-09-02 Thread Tallanel Siriel
Hello everyone. I send you this mail because I have a question related to quota in Kafka. I read the following articles there : - https://cwiki.apache.org/confluence/display/KAFKA/KIP-13+-+Quotas - https://kafka.apache.org/documentation/#design_quotas My Kafka cluster has the version 0.1

MirrorMaker 2 - is starting mirroring from latest possible?

2020-09-02 Thread Iftach Ben-Yosef
Hello, Whenever we add a new topic to the mirroring whitelist it starts to mirror the entire content of the source topic. If the topic is large this can create a long lag until the entire topic is mirrored, and it can also create some smaller delays on other mirrored topics (i'm assuming this is b