Race condition causing corrupted sstable stats files

Hannu Kröger Tue, 22 Aug 2017 07:20:31 -0700

Hello everyone,

We ran into some nasty bug while upgrading to Cassandra 3.11:
https://issues.apache.org/jira/browse/CASSANDRA-13752


Basically there is possible race condition in tombstone histogram
serialization. If someone modifies tombstone histogram after size is
serialized but before entries are serialized then you will end up in the
situation where stats files tells that you have X entries in the histogram
but after that you actually have X+N (N=1 seen in wild) entries written in
the file. Cassandra cannot deserialize that when it next time reads the
file (after a restart at least).

I have a fix for it:
https://github.com/hkroger/cassandra/tree/cassandra-3.11-13752

Writing a test for it is a bit hard. Also testing manually that it actually
fixes it is hard so…

How can we push this forward? It’s sort of critical for us and I would like
some more experienced C* dev to give fatherly or motherly advice :)

BR,
Hannu Kröger

Race condition causing corrupted sstable stats files

Reply via email to