Re: Perf on history reprocessing

2020-10-25 Thread Si Tang
Hi Mathieu, I recently helped my team diagnose a performance issue with Kafka Streams state store restoration (10X time after moving from 20 partitions to 60 partitions). Understanding the bottleneck should be the first thing to do. The problem in our case was memory. Our application was spending

Re: Perf on history reprocessing [kafka-streams]

2020-10-25 Thread Mathieu D
To clarify my question: here i'm focusing on the kafka-streams part. Le ven. 23 oct. 2020 à 20:07, Mathieu D a écrit : > Hello there > > Sometimes we need to reprocess a large amount of history data. > I find the performance in that case quite disappointing. More precisely > throughput is quite

Re: Perf on history reprocessing

2020-10-25 Thread Fabio Pardi
hi Mathieu, the best approach in my opinion is to try to understand where your bottleneck is, analyzing the graphs produced during history reprocessing. my best bet are the disks, but indeed it might be anywhere. regards, fabio pardi On 23/10/2020 20:07, Mathieu D wrote: > Hello there > >

Perf on history reprocessing

2020-10-23 Thread Mathieu D
Hello there Sometimes we need to reprocess a large amount of history data. I find the performance in that case quite disappointing. More precisely throughput is quite low (which is not surprising for a system optimized for low latency). Is there any knob to turn to get a much higher throughput in