Hi all i have the following scenario: a. a pipeline that reads messages from kafka and a session window with 1 minute duration. b. groupbykey in order to aggregate the data c. for each 'group' i do some calculation and build a new event to send to kafka.
the output of this cycle is key1 - value1 key2 - value2 If a new message arrives with the same key i would like to have a logic that checks if the current message is : key1-value1 don't send (because it was already sent). Currently we implemented this using DB (postgres). the performance of this implementation is not very good. Is there any way to implement this without any external state? thanks a lot Sigalit