row cache

2013-08-14 Thread Faraaz Sareshwala
At the Cassandra 2013 conference, Axel Liljencrantz from Spotify discussed various cassandra gotchas in his talk on "How Not to Use Cassandra." One of the sections of his talk was on the row cache. If you weren't at the talk, or don't remember it, the video is up on youtube [1]. The discussion o

Re: Custom commands in cassandra

2013-08-14 Thread Jon Haddad
Aside from the problems mentioned below, it's a rare case that tightly coupling your application code directly into your database makes it easier to maintain your codebase, especially as you scale. If you roll out your custom Cassandra application, then decide you need search, will you also emb

Re: cassandra 1.2.5- virtual nodes (num_token) pros/cons?

2013-08-14 Thread Aaron Morton
> Repair and bootstrap will be limited by the node doing repair or bootstrap, > since it has to do the same amount of work whatever num_tokens is. That's what I was thinking. I normally assume repair has very little data to stream, and most of the time is taken creating the merkle trees. In th

Re: Custom commands in cassandra

2013-08-14 Thread Aaron Morton
> They also stuck themselves on Cassandra 0.7 forever. To reinforce that point, look at the data stax site or the last conference for some of the performance metrics comparing 1.2 to 1.0 and before. While you are worrying about the transport to cassandra, the project making things go faster. IM

Re: Recommendation for hosting multi tenant clusters

2013-08-14 Thread Aaron Morton
I also saw somewhere (may have been twitter) that the reads they benchmark against for provisioned IOPS is something like 4KB or 8KB. It was something small, smaller than the page size memmapping will use anyway. Cheers - Aaron Morton Cassandra Consultant New Zealand @aaronmor

Re: understanding memory footprint

2013-08-14 Thread Aaron Morton
> "Does the number of column families still significantly impact the memory > footprint? If so, what is the incremental cost of a column family/table?" IMHO there would be little difference in memory use for a node with zero data that had 10 CF's and one that had 100 CF's. When you start putting

Re: Data modelling for range retrieval. Was: Re: Hadoop/Cassandra for data transformation (rather than analysis)?

2013-08-14 Thread Aaron Morton
> Is it good practice then to find an attribute in my data that would allow me > to form wide row row keys with aprox. 1000 values each? You can do that using get_range_slice() via thrift. And via CQL 3 you use the token() function and Limit with a select statement. Check the DS docs for more in

RE: Vnodes, adding a node ?

2013-08-14 Thread Andrew Cobley
That looks like the problem. I added the node with that machine as a seed, realized my mistake and restarted the machine with the correct seed. it joined the ring but without streaming. Nodetool rebuild however doesn't seem to be fixing the situation. I'll remove the node and try re-adding i

Re: Vnodes, adding a node ?

2013-08-14 Thread Richard Low
On 14 August 2013 20:02, Andrew Cobley wrote: > I have small test cluster of 2 nodes. I ran a stress test on it and with > nodetool status received the following: > > /usr/local/bin/apache-cassandra-2.0.0-rc1/log $ ../bin/nodetool status > Datacenter: datacenter1 > === > Sta

Vnodes, adding a node ?

2013-08-14 Thread Andrew Cobley
I have small test cluster of 2 nodes. I ran a stress test on it and with nodetool status received the following: /usr/local/bin/apache-cassandra-2.0.0-rc1/log $ ../bin/nodetool status Datacenter: datacenter1 === Status=Up/Down |/ State=Normal/Leaving/Joining/Moving -- Addre

Re: Vnodes, adding a node ?

2013-08-14 Thread sankalp kohli
What steps did you follow to add the 3 node? On Wed, Aug 14, 2013 at 12:02 PM, Andrew Cobley wrote: > I have small test cluster of 2 nodes. I ran a stress test on it and with > nodetool status received the following: > > /usr/local/bin/apache-cassandra-2.0.0-rc1/log $ ../bin/nodetool status >

Re: Cassandra HANGS after some writes

2013-08-14 Thread Robert Coli
On Tue, Aug 13, 2013 at 10:39 PM, Naresh Yadav wrote: > I made one single change in default cassandra.yaml, just to experiment. > > native_transport_min_threads: *1* > native_transport_max_threads: *1* > > with max one single thread for native protocol requests i noticed some > improvement, earli

Re: HsHa

2013-08-14 Thread Jake Luciani
This is technically a Thrift message not Cassandra, it happens when a client hangs up without closing the socket. You should be able to silence it by raising the class specific log level see log4j-server.properties as an example On Wed, Aug 14, 2013 at 9:59 AM, Alain RODRIGUEZ wrote: > @Commite

Re: HsHa

2013-08-14 Thread Alain RODRIGUEZ
@Commiters/Experts, Does this sound like a bug or like 4 PEBCAKs to you ? Should we raise a JIRA ? Alain 2013/8/14 Keith Wright > Same here on 1.2.4. > > From: Romain HARDOUIN > Reply-To: "user@cassandra.apache.org" > Date: Wednesday, August 14, 2013 3:36 AM > To: "user@cassandra.apache.org

Re: HsHa

2013-08-14 Thread Keith Wright
Same here on 1.2.4. From: Romain HARDOUIN mailto:romain.hardo...@urssaf.fr>> Reply-To: "user@cassandra.apache.org" mailto:user@cassandra.apache.org>> Date: Wednesday, August 14, 2013 3:36 AM To: "user@cassandra.apache.org" mail

Re: HsHa

2013-08-14 Thread Romain HARDOUIN
The same goes for us. Romain Alain RODRIGUEZ a écrit sur 13/08/2013 18:10:05 : > De : Alain RODRIGUEZ > A : user@cassandra.apache.org, > Date : 13/08/2013 18:10 > Objet : Re: HsHa > > I have this anytime I try to switch to "hsha" since 0.8. > > Always kept "sync" for this reason. Thought I