Hi,

I have a stream application that performs rollup from 15mins to Hourly,
then Hourly to Daily.  The process needs to be continuously run 24 hours
and each 15 mins approx 12 million records gets posted (a JSON record per
message) into the input topic. There are 3 separate processors
corresponding to the above, where Hourly and Daily maintains the state.
So, in hourly 10million needs to be retained ever hour in the state and in
daily overall 10 million.  42GB of memory is allocated for the whole
application and throughput is fine for until first 10hrs, after that it
degrades significantly.  Any suggestions on this to identify the delay and
to increase the throughput would be of great help?  We are on Kafka 1.0.0

Thanks
Siva

Reply via email to