On Thu, Feb 5, 2015 at 4:15 PM, Andreas Finke <andreas.fi...@solvians.com>
wrote:

>  we are currently writing the same column within a row multiple times (up
> to 10 times a second). I am familiar with the concept of tombstones in
> SSTables. My question is: I assume that in our case in most cases when a
> column gets overwritten it still resides in the memtable. So I assume for
> that particular case no tombstone is set but the column is replaced in
> memory and then the 'newest' version is flushed to disk.
>

Memtables are periodically flushed; some percentage of these "bursts" will
always cross a flush boundary, leading to at least two writes to disk for
the same exact value. This is without considering any concerns of the class
that Johnathan Haddad mentions.

I personally would be conceptually uncomfortable with such a design as
having a strong smell of Doing It Wrong, but you should be able to design a
test which illustrates how badly (or not?) it bloats your SSTables with
garbage in actual operation?

Or Is writing the same column an an anti-pattern?
>

Writing the same column 10 times in a second is likely to be an
anti-pattern for a log structured data-store with immutable data files.
Consider a memory oriented database?

=Rob

Reply via email to