Re: performance tuning - where does the slowness come from?

B. Todd Burruss Thu, 06 May 2010 14:43:23 -0700

i think you will see a slow down because of large values in yourcolumns. make sure you take a look at MemtableThroughputInMB in yourconfig. if you are writing 1MB of data per row, then you'll probablywant to increase this quite a bit so you are not constantly creatingsstables. can't recall, did you see compaction mgr reporting a lot ofpending compactions? maybe try to "chunk" your data into multiplecolumns or multiple rows.

i too see slowness that exhibits in the same manner as you guys havedescribed. i'm still trying to track it down as well.


On 05/06/2010 10:56 AM, Ran Tavory wrote:

Jonathan, I think it's the case of large values in the columns. Theproblematic CF is a key-value store, so it has only one column perrow, however the value of that column can be large. It's a javaserialized object (uncompressed) which, may be 100s of bytes, maybeeven a few megs. This CF also suffers from zero cache hits since eachtime a read is for a unique key.
I ran stress.py and I see much better results (reads are < 1ms) so Iassume my cluster is healthy, so I need to fix the app. Would 1megbytes object explain a 30ms (sometimes even more) read latency? Theboxes aren't fancy, not sure exactly what hardware we have there butit's "commodity"...
Thanks!
On Thu, May 6, 2010 at 5:22 PM, Jonathan Ellis <jbel...@gmail.com<mailto:jbel...@gmail.com>> wrote:
    columns, not CFs.

    put another way, how wide are the rows in the slow CF?

    On Wed, May 5, 2010 at 11:30 PM, Ran Tavory <ran...@gmail.com
    <mailto:ran...@gmail.com>> wrote:
    > I have a few CFs but the one I'm seeing slowness in, which is
    the one with
    > plenty of cache misses has only one column per key.
    > Latency varies b/w 10m and 60ms but I'd say average is 30ms.
    >
    > On Thu, May 6, 2010 at 4:25 AM, Jonathan Ellis
    <jbel...@gmail.com <mailto:jbel...@gmail.com>> wrote:
    >>
    >> How many columns are in the rows you are reading from?
    >>
    >> 30ms is quite high, so I suspect you have relatively large rows, in
    >> which case decreasing the column index threshold may help.

    --
    Jonathan Ellis
    Project Chair, Apache Cassandra
    co-founder of Riptano, the source for professional Cassandra support
    http://riptano.com

Re: performance tuning - where does the slowness come from?

Reply via email to