State store and input topics with different number of partition

Hugues . Deslandes Fri, 01 Sep 2017 05:43:06 -0700

Hi,

I'd like to have your comments on the problem I met while testing my app 
with kafka streams  (0.10.2.1) 
Roughly, my stream app has 2 input topics :
. the first one has 4 partitions (main data)
. the second one has only one partition and receives messages from time to 
time


At first, I supposed I had 2 sub topologies 
A : With the first topic, I build a state store using process() and I also 
have punctuate activated . 
B : The second topic is used to trigger an analysis using the state store 
data with process()
(both processes use the same kafka topic as sink) 

During tests I realised the content of the state store viewed by this 
process B is only based on data received on partition 0 fo the first topic
I finally understood the link between those 2 sub-topologies  forced the 
system to see it as one unique topology and have only one task by 
partition reading the first and second topic; am I right ?

I imagined 2 options to solve this issue 
option 1 : replace topology B by a consumer on second topic that will 
trigger a query on statestore 
option 2 : have 4 partitions for topic 2 and write the same message in the 
4 partitions 

I tested both but not sure which one is better ...
Do you have any other suggestions or comments
Thanks in advance.

Hugues

State store and input topics with different number of partition

Reply via email to