Hi all,

we just had a case with Kafka 0.9 where an index rebuild for ~200M segments took on average 45 seconds. All indexes of a partition were corrupt. There are 13 segments and the rebuild took 10 minutes.

After the rebuild, these are representative sizes:

% ll -h /data/xyz-0
-rw-r--r-- 1 solr solr  45K Aug 16 10:44 00000000000000096346.index
-rw-r--r-- 1 solr solr 191M Aug 16 10:44 00000000000000096346.log

I now wonder whether this is an expected. Here is a log-excerpt showing the long run time for this segment, this one more than one minute.


[2016-08-16 10:44:20,831] WARN Found a corrupted index file, /data/xyz-0/00000000000000096346.index, deleting and rebuilding index... (kafka.log.Log) [2016-08-16 10:45:46,305] WARN Found a corrupted index file, /data/xyz-0/00000000000000011722.index, deleting and rebuilding index... (kafka.log.Log)

These runtime seem excessive on normal desktop hardware. Or am I underestimating the necessary effort to rebuild an index file?

Harald.

Reply via email to