Row slice / cache performance

2010-05-02 Thread James Golick
1. I wrote the list a while back about less-than-great performance when reading thousands of columns even on cache hits. Last night, I decided to try to get to the bottom of why. I tested this by setting the row cache capacity on a TimeUUIDType-sorted CF to 10, filling up a single row wi

Re: Row slice / cache performance

2010-05-02 Thread James Golick
Not sure why the first paragraph turned in to a numbered bullet... On Sun, May 2, 2010 at 11:00 AM, James Golick wrote: > >1. I wrote the list a while back about less-than-great performance when >reading thousands of columns even on cache hits. Last night, I decided to >

Re: Row slice / cache performance

2010-05-02 Thread James Golick
close to zero. But, the performance doesn't seem to have noticeably improved. So, I'm not sure what I'm missing here. Either my test methodology is broken or I completely misread the profile. On Sun, May 2, 2010 at 11:01 AM, James Golick wrote: > Not sure why the first paragr

Re: Row slice / cache performance

2010-05-02 Thread James Golick
Got a ~50% improvement by making UUID comparison less heavy-weight. https://issues.apache.org/jira/browse/CASSANDRA-1043 On Sun, May 2, 2010 at 7:49 PM, James Golick wrote: > Just an update on this. I wrote a patch which attempts to solve this > problem by keeping an index of columns th

Cache capacities set by nodetool

2010-05-12 Thread James Golick
When I first brought this cluster online, the storage-conf.xml file had a few cache capacities set. Since then, we've completely changed how we use cassandra's caching, and no longer use any of the caches I setup in the original configuration. I'm finding that cassandra doesn't want to keep my new

Re: Cache capacities set by nodetool

2010-05-12 Thread James Golick
-ryan > > On Wed, May 12, 2010 at 8:16 AM, James Golick > wrote: > > When I first brought this cluster online, the storage-conf.xml file had a > > few cache capacities set. Since then, we've completely changed how we use > > cassandra's caching, and no longer

Re: Cache capacities set by nodetool

2010-05-12 Thread James Golick
Picked up out of config, I mean. On Wed, May 12, 2010 at 11:10 AM, James Golick wrote: > Hmmm that's definitely what we're seeing. Although, we aren't seeing > cache settings being picked up properly on a restart either. > > > On Wed, May 12, 2010 at 8:42 AM,

Continuously increasing RAM usage

2010-05-26 Thread James Golick
We're seeing RAM usage continually climb until eventually, cassandra becomes unresponsive. The JVM isn't OOM'ing. It has only committed 14/24GB of memory. So, I am assuming that the memory usage is related to mmap'd IO. Fair assumption? I tried setting the IO mode to standard, but it seemed to be

Re: Continuously increasing RAM usage

2010-05-27 Thread James Golick
JDK 64-Bit Server VM (build 1.6.0-b09, mixed mode) Here is the tpstats from a node that was restarted last night: http://gist.github.com/415987 On Thu, May 27, 2010 at 8:51 AM, Robert Coli wrote: > On 5/26/10 11:32 PM, James Golick wrote: > >> We're seeing RAM usage continu

Re: Continuously increasing RAM usage

2010-05-27 Thread James Golick
d Cassandra 0.6.1 fixed a lot of memory problems I ran into. > > *From:* James Golick [mailto:jamesgol...@gmail.com] > *Sent:* Thursday, May 27, 2010 9:05 AM > *To:* user@cassandra.apache.org > *Subject:* Re: Continuously increasing RAM usage > > When I say unresponsive, I mean

Compaction bringing a node to its knees

2010-05-29 Thread James Golick
I just experienced a compaction that brought a node to 100% of its IO capacity and made its responses incredibly slow. It wasn't enough to make the node actually appear as down, though, so it slowed down the operation of the cluster considerably. The CF being compacted contains a lot of relativel

Re: Continuously increasing RAM usage

2010-05-29 Thread James Golick
Well, it's been a few days on 0.6.2 and the new jvm and the behaviour looks to be about the same: http://skitch.com/jamesgolick/df46f/munin-fetlife.com-cassandra0.fetlife.com-cassandra-memory There's only one cache turned on, and it's a row cache, but the sizes of the rows are identical and it's

Inconsistency even after nodetool repair?

2010-05-30 Thread James Golick
I was getting a ton of these in my logs: INFO [pool-1-thread-77] 2010-05-30 03:39:18,263 StorageProxy.java (line 499) DigestMismatchException: Mismatch for key 142667/everything (5546784193bbe4a4e066dc7fc142f589 vs 03edb87cd131a614d8256ddfdc50dd17) The nodes were all getting really overloaded, p

Re: Inconsistency even after nodetool repair?

2010-05-31 Thread James Golick
repair` call is not blocking before 0.6.3 (unreleased): see > https://issues.apache.org/jira/browse/CASSANDRA-1090 > > -Original Message- > From: "James Golick" > Sent: Sunday, May 30, 2010 3:43am > To: cassandra-u...@incubator.apache.org > Subject: Inconsistency

Failover and slow nodes

2010-06-18 Thread James Golick
Our cassandra client fails over if a node times out. Aside from actual failure, repair and major compactions can make a node so slow that it affects application performance. One problem we've run in to is that a node in the midst of repair will still have requests routed to it internally, even if

Re: Failover and slow nodes

2010-06-18 Thread James Golick
e- > > From: "Benjamin Black" > > Sent: Friday, June 18, 2010 12:32pm > > To: user@cassandra.apache.org > > Subject: Re: Failover and slow nodes > > > > Would be interesting to have a snitch that manipulated responses for > > read nodes based on histo

Uneven distribution using RP

2010-06-20 Thread James Golick
I just increased my cluster from 2 to 4 nodes, and RF=2 to RF=3, using RP. The tokens seem pretty even on the ring, but two of the nodes are far more heavily loaded than the others. I understand that there are a variety of possible reasons for this, but I'm wondering whether anybody has suggestion

Re: Uneven distribution using RP

2010-06-20 Thread James Golick
On Sun, Jun 20, 2010 at 4:16 PM, James Golick wrote: > >> I just increased my cluster from 2 to 4 nodes, and RF=2 to RF=3, using RP. >> >> The tokens seem pretty even on the ring, but two of the nodes are far more >> heavily loaded than the others. I understand that ther

Re: Uneven distribution using RP

2010-06-20 Thread James Golick
ding to the formula i * > (2**127 / 4) for i=1..4 > > > On Sun, Jun 20, 2010 at 4:31 PM, James Golick wrote: > >> I ran cleanup on all of them and the distribution looked roughly even >> after that, but a couple of days later, it's looking pretty uneven. >&g

Instability and memory problems

2010-06-20 Thread James Golick
As I alluded to in another post, we just moved from 2-4 nodes. Since then, the cluster has been incredibly The memory problems I've posted about before have gotten much worse and our nodes are becoming incredibly slow/unusable every 24 hours or so. Basically, the JVM reports that only 14GB is comm

Re: Instability and memory problems

2010-06-20 Thread James Golick
Thanks for your thoughts. Answers below: On Sun, Jun 20, 2010 at 2:21 PM, Peter Schuller wrote: > > The memory problems I've posted about before have gotten much worse and > our > > nodes are becoming incredibly slow/unusable every 24 hours or so. > Basically, > > the JVM reports that only 14GB

Re: Instability and memory problems

2010-06-20 Thread James Golick
n out of memory, and there's no way at all to control it. Can we consider changing the default? On Sun, Jun 20, 2010 at 3:37 PM, James Golick wrote: > Thanks for your thoughts. Answers below: > > On Sun, Jun 20, 2010 at 2:21 PM, Peter Schuller < > peter.schul...@infidyne.com

Re: Instability and memory problems

2010-06-20 Thread James Golick
I opened #1214 about this. I hope people will take a look and provide their feedback. https://issues.apache.org/jira/browse/CASSANDRA-1214 Thanks. On Sun, Jun 20, 2010 at 3:58 PM, James Golick wrote: > uh. wow. I just read up on all this again, and read the code, and I'm a > littl

Re: Instability and memory problems

2010-06-21 Thread James Golick
Just an update here. We're now entirely on standard IO mode, and everything is stable and happy. There hasn't been much of a performance hit, if at all. - James On Mon, Jun 21, 2010 at 3:30 AM, Peter Schuller wrote: > > How much of your physical RAM is dedicatd to the JVM? > > > > I forgot to s

Re: Instability and memory problems

2010-06-21 Thread James Golick
On Mon, Jun 21, 2010 at 12:24 PM, Peter Schuller < peter.schul...@infidyne.com> wrote: > > Just an update here. We're now entirely on standard IO mode, and > everything > > is stable and happy. There hasn't been much of a performance hit, if at > all. > > Cool. Just be aware that if my speculation

Re: Uneven distribution using RP

2010-06-22 Thread James Golick
are much bigger? The keys will be distributed without > considering the size of the data. > > The RP is random though, i do not think it tries to evenly distribute the > keys. So some variance with a small number of nodes should be expected IMHO. > > Aaron > > On 21 Jun 2010, at 02:

Re: Uneven distribution using RP

2010-06-22 Thread James Golick
RackUnaware, currently On Tue, Jun 22, 2010 at 1:26 PM, Robert Coli wrote: > On 6/22/10 10:07 AM, James Golick wrote: > >> This node's load is now growing at a ridiculous rate. It is at 105GB, with >> the next most loaded node at 70.63GB. >> >> Given that RF=

Re: Uneven distribution using RP

2010-06-22 Thread James Golick
d is growing as quickly as that one. - James On Tue, Jun 22, 2010 at 1:31 PM, James Golick wrote: > RackUnaware, currently > > > On Tue, Jun 22, 2010 at 1:26 PM, Robert Coli wrote: > >> On 6/22/10 10:07 AM, James Golick wrote: >> >>> This node's load is no

Re: Uneven distribution using RP

2010-06-22 Thread James Golick
Jun 22, 2010 at 5:27 PM, Jeremy Dunck wrote: > On Tue, Jun 22, 2010 at 4:08 PM, James Golick > wrote: > > Turns out that this is due to a larger proportion of the wide rows in the > > system being located on that node. I moved its token over a little to > > compensate for

Never ending compaction

2010-06-22 Thread James Golick
We had to take a node down for an upgrade last night. When we brought it back online in the morning, it got slammed by HH data all day so badly that it was compacting near constantly, and the pending compactions pool was piling up. I shut most of the writes down to let things catch up, which they m

UnavailableException with 1 node down and RF=2?

2010-06-30 Thread James Golick
4 nodes, RF=2, 1 node down. How can I get an UnavailableException in that scenario? - J.

Re: UnavailableException with 1 node down and RF=2?

2010-06-30 Thread James Golick
Oops. I meant to say that I'm reading with CL.ONE. J. Sent from my iPhone. On 2010-07-01, at 1:39 AM, Benjamin Black wrote: > .QUORUM or .ALL (they are the same with RF=2). > > On Wed, Jun 30, 2010 at 10:22 PM, James Golick wrote: >> 4 nodes, RF=2, 1 node down.

Re: UnavailableException with 1 node down and RF=2?

2010-07-01 Thread James Golick
It's happening consistently when I take any node out of rotation. On Thu, Jul 1, 2010 at 2:24 AM, Jonathan Ellis wrote: > Presumably the failure detector generated a false positive for a > second node temporarily > > On Wed, Jun 30, 2010 at 10:55 PM, James Golick > wrote:

Nodes Timing Out

2010-03-27 Thread James Golick
Hey, I put our first cluster in to production (writing but not reading) a couple of days ago. Right now, it's got two pretty sizeable nodes taking about 200 writes per second each and virtually no reads. Eventually, though, (and this has happened twice), both nodes seem to start timing out. If I

Re: Nodes Timing Out

2010-03-27 Thread James Golick
, Mar 27, 2010 at 12:29 PM, James Golick > wrote: > > Hey, > > I put our first cluster in to production (writing but not reading) a > couple > > of days ago. Right now, it's got two pretty sizeable nodes taking about > 200 > > writes per second each and vi

Re: Nodes Timing Out

2010-03-28 Thread James Golick
unlimited On Sat, Mar 27, 2010 at 12:09 PM, Chris Goffinet wrote: > what's the ulimit set to? > > -Chris > > On Mar 27, 2010, at 10:29 AM, James Golick wrote: > > Hey, > > I put our first cluster in to production (writing but not reading) a couple > of days

Re: Nodes Timing Out

2010-03-28 Thread James Golick
Oops, I was doing ulimit. ulimit -n returns 1024. On Sun, Mar 28, 2010 at 3:25 AM, Benoit Perroud wrote: > ulimit -n returns you unlimited ? > > > 2010/3/28 James Golick : > > unlimited > > > > On Sat, Mar 27, 2010 at 12:09 PM, Chris Goffinet > wrote: &

Read Performance

2010-03-30 Thread James Golick
We are starting to use cassandra to power our activity feed. The way we organize our data is simple. "Event"s live in a CF called Events and are keyed by a UUID. The timelines themselves live in a CF called Timelines, which is keyed by user id (i.e. "1229") and contains a event uuids as column name

Re: Read Performance

2010-03-30 Thread James Golick
"no change observed" or "faster but not > fast enough?" > > On Tue, Mar 30, 2010 at 10:47 PM, James Golick > wrote: > > We are starting to use cassandra to power our activity feed. The way we > > organize our data is simple. "Event"s live in a

Re: Read Performance

2010-03-30 Thread James Golick
Okay, so now my row cache hit rate jumps between 1.0, 99.5, 95.6, and NaN. Seems like that stat is a little broken. Still seeing around 35ms to multiget 20 rows. - James On Tue, Mar 30, 2010 at 9:22 PM, Ryan King wrote: > On Tue, Mar 30, 2010 at 9:11 PM, James Golick > wrote: > &g

Re: Read Performance

2010-03-31 Thread James Golick
I'm testing on the live cluster, but most of the production reads are being served by the cache. It's definitely the right CF. On Wed, Mar 31, 2010 at 8:30 AM, Jonathan Ellis wrote: > On Wed, Mar 31, 2010 at 12:01 AM, James Golick > wrote: > > Okay, so now my row cache

Re: Read Performance

2010-03-31 Thread James Golick
f the reads to replay asynchronously against Cassandra. > > > > On Wed, 2010-03-31 at 11:04 -0500, Jonathan Ellis wrote: > >> Can you redirect some of the reads from memcache to cassandra? Sounds > >> like the cache isn't getting warmed up. > >> >

Re: Read Performance

2010-03-31 Thread James Golick
Standard CF. 10 columns per row. Between about 800 bytes and 2k total per row. On Wed, Mar 31, 2010 at 3:06 PM, Chris Goffinet wrote: > How many columns in each row? > > -Chris > > On Mar 31, 2010, at 2:54 PM, James Golick wrote: > > I just tried running the same multi_

Re: Read Performance

2010-03-31 Thread James Golick
Row cache hit rate: 0.007716049382716049 On Wed, Mar 31, 2010 at 4:15 PM, Jonathan Ellis wrote: > What does the CFS mbean think read latencies are? Possibly something > else is introducing latency after the read. > > On Wed, Mar 31, 2010 at 5:37 PM, James Golick > wrote: >

Re: Read Performance

2010-04-01 Thread James Golick
> times (read/write in ms) in your previous mails. Can you explain a little? > Iike to work on it also. > > CD > > > On Thu, Apr 1, 2010 at 4:15 AM, Jonathan Ellis wrote: > >> On Wed, Mar 31, 2010 at 6:21 PM, James Golick >> wrote: >> > Keyspa

Re: Read Performance

2010-04-01 Thread James Golick
Damnit! On Thu, Apr 1, 2010 at 2:05 PM, Jeremy Dunck wrote: > Or rackspace. ;) > > On Thu, Apr 1, 2010 at 2:49 PM, Joseph Stump wrote: > > Taking our flamewar offline. :-D > > > > On Thu, Apr 1, 2010 at 1:36 PM, James Golick > wrote: > >> I don&#

Re: Read Performance

2010-04-01 Thread James Golick
ot of time was being spent in thrift. Turns out, that's where pretty much all the time was going. I just ran the same test using java (scala) and the load is taking around 2-4ms. On Thu, Apr 1, 2010 at 4:37 PM, Peter Chang wrote: > pwned. > > > On Thu, Apr 1, 2010 at 2:09 PM, Jame

Re: Read Performance

2010-04-01 Thread James Golick
Yes. J. Sent from my iPhone. On 2010-04-01, at 9:21 PM, Brandon Williams wrote: On Thu, Apr 1, 2010 at 9:37 PM, James Golick wrote: Well, folks, I'm feeling a little stupid right now (adding to the injury inflicted by one Mr. Stump :-P). So, here's the story. The cache hit

Re: Read Performance

2010-04-02 Thread James Golick
Yes. On Fri, Apr 2, 2010 at 10:35 AM, Ryan King wrote: > On Thu, Apr 1, 2010 at 8:37 PM, James Golick > wrote: > > Well, folks, I'm feeling a little stupid right now (adding to the injury > > inflicted by one Mr. Stump :-P). > > So, here's the story. The c

Reading thousands of columns

2010-04-13 Thread James Golick
Hi All, I'm seeing about 35-50ms to read 1000 columns from a CF using get_range_slices. The columns are TimeUUIDType with empty values. The row cache is enabled and I'm running the query 500 times in a row, so I can only assume the row is cached. Is that about what's expected or am I doing somet

Re: Reading thousands of columns

2010-04-14 Thread James Golick
eason that > oracle doesn't cache tuples from sequential scans -- blowing away > 1000s of rows worth of recently used rows queried by key, for a swath > of rows from the scan, is the wrong call more often than it is the > right one. > > On Tue, Apr 13, 2010 at 1:00 PM, James G

Re: Reading thousands of columns

2010-04-14 Thread James Golick
That helped a little. But, it's still quite slow. Now, it's around 20-35ms on average, sometimes as high as 70ms. On Wed, Apr 14, 2010 at 8:50 AM, James Golick wrote: > Right - that make sense. I'm only fetching one row. I'll give it a try with > get_slice(). > >

Re: Reading thousands of columns

2010-04-14 Thread James Golick
Just for the record, I am able to repeat this locally. I'm seeing around 150ms to read 1000 columns from a row that has 3000 in it. If I enable the rowcache, that goes down to about 90ms. According to my profile, 90% of the time is being spent waiting for cassandra to respond, so it's not thrift.

Re: Reading thousands of columns

2010-04-14 Thread James Golick
The values are empty. It's 3000 UUIDs. On Wed, Apr 14, 2010 at 12:40 PM, Avinash Lakshman < avinash.laksh...@gmail.com> wrote: > How large are the values? How much data on disk? > > On Wednesday, April 14, 2010, James Golick wrote: > > Just for the record, I am a