Re: : Cassandra reads under write-only load, read degradation after massive writes

Jeremiah Jordan Wed, 09 Nov 2011 13:08:09 -0800

Indexed columns cause read before write so that the index can be updatedif the column already exists.


On 11/09/2011 02:46 PM, Oleg Tsernetsov wrote:

When monitoring JMX metrics of cassandra 0.8.7 loaded by write-onlytest I observe significant read activity on column family where Iwrite to. It seems strange to me, but I expected no read activity onwrite-only load. The read activity is caused by writes, as when I stopthe write test, reads activity disappears. The test performs parallelcolumn writes to a single row, writing the values of fixed column setover and over again. Furthermore, the second problem is that parallelmassive reads of such row degrade over time (even without parallelwrite load) and cassandra starts burning 100% of CPU with read latencydegrading x20 times comparing with exactly the same row created fromscratch. The test setup is 3 cassandra nodes, read/write consistency =Quorum. Row has 10 and above columns (tested with 10, 100, 1000, 10000cols), the higher is the number of columns, the worse is observeddegradation. Column family has 2 indexed columns that are written withexactly the same values on each and every write. Row key, column nameand column value are all Utf8Type. Column family compaction on all thenodes does not help, and the row remains "degraded". Read here meansone of: read all the the columns with slice query without bounds/withbounds; executing column count query for a row with bounds/withoutbounds. I use Hector as cassandra client. I would be thankful ifanyone could explain the read activity on write load and give anyhints on row read degradation after massive write load on that row.
Regards,
Oleg

Re: : Cassandra reads under write-only load, read degradation after massive writes

Reply via email to