Re: HsHa

2013-08-21 Thread Alain RODRIGUEZ
@Christopher, not sure if you noticed it but, CASSANDRA-4573 is now fixed in C*2.0.0 RC2 => http://goo.gl/AGVTOF No idea if this could fix our issue Alain 2013/8/14 Jake Luciani > This is technically a Thrift message not Cassandra, it happens when a > client hangs up without closing the socke

Compilation errors when setting up Cassandra as a project

2013-08-21 Thread Nipuni Piyabasi Perera
Hi, I am new to Cassandra. have followed the guide here [1] and imported Cassandra as an IntelliJIdea project. But when I tried to build using 'Ant Build', it gives compilation errors. When I tried to open the java classes in the workspace, all most all of the errors are due to missing libraries.

Re: Compilation errors when setting up Cassandra as a project

2013-08-21 Thread Nipuni Piyabasi Perera
Hi, This was a dependency issue, and now it is fixed. Thanks, Nipuni On Wed, Aug 21, 2013 at 2:15 PM, Nipuni Piyabasi Perera < nipuni880...@gmail.com> wrote: > Hi, > > I am new to Cassandra. have followed the guide here [1] and imported > Cassandra as an IntelliJIdea project. But when I tried

Re: Nodes get stuck

2013-08-21 Thread Sylvain Lebresne
A thread dump on one of the machine that has a suspiciously high CPU might help figuring out what it is that is taking all that CPU. On Wed, Aug 21, 2013 at 8:57 AM, Keith Wright wrote: > Some last minute info on this to hopefully enlighten. We are doing ~200 > reads and writes across our 7 no

Moving a cluster between networks.

2013-08-21 Thread Tim Wintle
Hi, Suppose we have two networks: 10.1.0.0/16 and 10.2.0.0/16. It is not possible to route packets between the two networks, but all nodes have interfaces on both networks, so any node can communicate with any address on either network. We are currently running our all nodes on one network, but

bug in Pig LOAD with cqlStorage and param columns? - cassandra 1.2.8 - pig 0.11.1

2013-08-21 Thread Miguel Angel Martin junquera
hi: I am using pig 0.11.1 and cassandra 1.2.8. i try this http://frommyworkshop.blogspot.com.es/2013/07/hadoop-map-reduce-with-cassandra.html and... *rows = LOAD 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING CqlStorage();* *dump rows;* works fine if I skip t

Re: bug in Pig LOAD with cqlStorage and param columns? - cassandra 1.2.8 - pig 0.11.1

2013-08-21 Thread Jeremy Hanna
In order to narrow down the problem, I would start without the request parameters and see if that works. Then I would add the request parameters one at a time to see what breaks things. Often pig is not very helpful with its error messages, so I've had to use this method a lot. On 21 Aug 2013

Re: Secondary Index Question

2013-08-21 Thread Hiller, Dean
Yup, there are other types of indexing like that in PlayOrm which do it differently so all nodes are not hit so it works better for instance if you are partitioning your data and you query into just a single partition so it doesn't put load on all the nodes. (of course, you have to have a parti

Re: bug in Pig LOAD with cqlStorage and param columns? - cassandra 1.2.8 - pig 0.11.1

2013-08-21 Thread Shamim
Hello, When i have tested the issue CASSANDRA-5234, it's work with the following query rows = LOAD 'cql://keyspace1/test?page_size=1&columns=title,age&split_size=4&where_clause=age%3D30' USING CqlStorage(); there was no problem with param columns. May be something goes wrong with version 1.2.8

Automatic tombstone compaction

2013-08-21 Thread tamas.foldesi
Hi, After upgrading from 1.0 to 1.2, I wanted to make use of the automatic tombstone compaction feature, so using CQL3 I issued: ALTER TABLE versions WITH compaction = {'class' : 'SizeTieredCompactionStrategy', 'min_threshold' : 4, 'max_threshold' : 32, 'tombstone_compaction_interval' : 1, 'to

Re: Automatic tombstone compaction

2013-08-21 Thread Haithem Jarraya
Hi, do you mean LeveledCompactionStrategy? Also you will need to run nodetool upgradesstables [keyspace][cf_name] after changing the compaction strategy. Thanks, Haithem Jarraya On 21 Aug 2013, at 15:15, tamas.fold...@thomsonreuters.com wrote: Hi, A

RE: Automatic tombstone compaction

2013-08-21 Thread tamas.foldesi
Hi, I ran upgradesstables as part of the Cassandra upgrade, before issuing the CQL alter command. According to the docs, SizeTieredCompactionStrategy is fine (that is what I used, and plan on continue using), and automatic tombstone compaction is available for it: http://www.datastax.com/docume

Cassandra Secondary Index implementation

2013-08-21 Thread Nipuni Piyabasi Perera
Hi, We are interested in secondary index implementation of Cassandra. What are the classes that we need to approach in order to get an understanding on secondary index implementation. We could download and setup the basic configuration to run Cassandra.We also could setup Cassandra as a project.

RE: Secondary Index Question

2013-08-21 Thread Kanwar Sangha
Thanks Dean. Any reason why it is sequential ? It is to avoid loading all the nodes and see if one node can return the desired results ? -Original Message- From: Hiller, Dean [mailto:dean.hil...@nrel.gov] Sent: 21 August 2013 07:36 To: user@cassandra.apache.org Subject: Re: Secondary In

Re: Automatic tombstone compaction

2013-08-21 Thread Haithem Jarraya
I guess I didn't understand your question then, I thought you changed the compaction strategy. If that what you did, you have to run upgradesstaable again. On 21 Aug 2013, at 15:33, tamas.fold...@thomsonreuters.com wrote: Hi, I ran upgradesstables as p

Re: Automatic tombstone compaction

2013-08-21 Thread Yuki Morishita
Tamas, If there are rows with the same key in other SSTables, that rows won't be deleted. Tombstone compaction make guess if it can actually drop "safely" by scanning overlap with other SSTables. Do you have many rows in your large SSTable? If you don't, then chance to run tombstone compaction may

Re: Secondary Index Question

2013-08-21 Thread Hiller, Dean
Sorry, I forget why. Someone told me at the cassandra conference. It might be to not overload the entire cluster at once so if you have 1000 nodes and you run just 5 queries, you could take out your cluster. (This is why I use playorm's querying and in tons of use cases, you don't want to query

Re: Secondary Index Question

2013-08-21 Thread Hiller, Dean
Oh, I do know it is not "see if one node can return the desired results" as each node will have different results for your client and you get results from the first node, then results from second node, etc. etc. (I remember having this discussion but for the life of me can't remember why it is seq

Re: how-to scan a table using CQL3

2013-08-21 Thread Alain RODRIGUEZ
Hi, I am sorry about digging this up but I was in search of this kind of information and read this thread. How to make sure that the first rowkey you select has the smaller token ? I mean when you perform "select rowkey from my_table limit N;" can you have any data with any token or is data token

Merge C* Cluster

2013-08-21 Thread Julio Quierati
Hello, I have 3 clusters in aws, need to turn in a single cluster. My current infrastructure: Cluster 1: 3 xlarge, replication factor 3, with 80% disk usage, 4 ephemeral raid0 xfs 1.7TB Cluster 2: 3 xlarge, replication factor 2, with 40% disk usage, 4 ephemeral raid0 xfs 1.7TB Cluster 3: 2 xlarge,

High CPU usage with authentication enabled

2013-08-21 Thread Joshua M. Thompson
Hello, I've been tasked with tuning a Cassandra-based app for eventual production deployment and I'm running into an issue I can't seem to solve when I run my load tests. I'm still relatively new to Cassandra so i'm hoping there is something obvious I'm missing here. Basically, everything runs gr

Re: Secondary Index Question

2013-08-21 Thread Robert Coli
On Tue, Aug 20, 2013 at 5:57 PM, Kanwar Sangha wrote: > Hi – I was reading some blogs on implementation of secondary indexes in > Cassandra and they say that “the read requests are sent sequentially to all > the nodes” ? > > ** ** > > So if I have a query to fetch ALL records with the secon

Re: Moving a cluster between networks.

2013-08-21 Thread Robert Coli
On Wed, Aug 21, 2013 at 3:58 AM, Tim Wintle wrote: > What would the best way to achieve this? (We can tolerate a fairly short > period of downtime). > I think this would work, but may require a full cluster shutdown. 1) stop nodes on old network 2) set auto_bootstrap to false in the conf file (

Re: Automatic tombstone compaction

2013-08-21 Thread Robert Coli
On Wed, Aug 21, 2013 at 8:23 AM, Yuki Morishita wrote: > If there are rows with the same key in other SSTables, that rows won't > be deleted. > Tombstone compaction make guess if it can actually drop "safely" by > scanning overlap with other SSTables. > Background @ : https://issues.apache.org/

Re: Nodes get stuck

2013-08-21 Thread Robert Coli
On Tue, Aug 20, 2013 at 11:35 PM, Keith Wright wrote: > Still looking for help! We have stopped almost ALL traffic to the cluster > and still some nodes are showing almost 1000% CPU for cassandra with no > iostat activity. We were running cleanup on one of the nodes that was not > showing load

Re: Nodes get stuck

2013-08-21 Thread Robert Coli
On Wed, Aug 21, 2013 at 10:47 AM, Robert Coli wrote: > On Tue, Aug 20, 2013 at 11:35 PM, Keith Wright wrote: > >> Still looking for help! We have stopped almost ALL traffic to the >> cluster and still some nodes are showing almost 1000% CPU for cassandra >> with no iostat activity. We were run

RE: Automatic tombstone compaction

2013-08-21 Thread tamas.foldesi
Well, these tables are somewhat similar to a 'cache' - we insert rows, then leave them for a week using TTL (usually untouched, read only), and then we need to compact them away. If I understand correctly, they should not be affected by the below issue... The question is rather if the setup is c

Re: Automatic tombstone compaction

2013-08-21 Thread Nate McCall
In the context of Yuki's response, if you are using the same key for the cache, then your rows will get increasingly fragmented. On Wed, Aug 21, 2013 at 1:09 PM, wrote: > Well, these tables are somewhat similar to a 'cache' - we insert rows, > then leave them for a week using TTL (usually unto

Re: insert performance (1.2.8)

2013-08-21 Thread Keith Freeman
Building the giant batch string wasn't as bad as I thought, and at first I had great(!) results (using "unlogged" batches): 2500 rows/sec (batches of 100 in 48 threads) ran very smoothly, and the load on the cassandra server nodes averaged about 1.0 or less continuously. But then I upped it to

Re: High CPU usage with authentication enabled

2013-08-21 Thread Nate McCall
The bcrypt rounds are indeed expensive and ClientState should hold the result for the active connection. So it sounds like you are creating a lot of new connections and thus hitting that bcrypt penalty. On Wed, Aug 21, 2013 at 12:28 PM, Joshua M. Thompson < joshua.thomp...@gmail.com> wrote: > He

Re: insert performance (1.2.8)

2013-08-21 Thread Nate McCall
The only thing I can think to suggest at this point is upping that batch size - say to 500 and see what happens. Do you have any monitoring on this cluster? If not, what do you see as the output of 'nodetool tpstats' while you run this test? On Wed, Aug 21, 2013 at 1:40 PM, Keith Freeman <8fo...

RE: Automatic tombstone compaction

2013-08-21 Thread tamas.foldesi
Actually each column in a row has the same TTL, so I guess that will not cause fragmentation...? Anyway - any feedback on the questions in my previous mail? From: Nate McCall [n...@thelastpickle.com] Sent: Wednesday, August 21, 2013 8:26 PM To: Cassandra Users Subj

Re: High CPU usage with authentication enabled

2013-08-21 Thread Joshua M. Thompson
On Wed, Aug 21, 2013 at 2:46 PM, Nate McCall wrote: > The bcrypt rounds are indeed expensive and ClientState should hold the > result for the active connection. So it sounds like you are creating a lot > of new connections and thus hitting that bcrypt penalty. > > Thanks, that was my thought too

Re: insert performance (1.2.8)

2013-08-21 Thread Nate McCall
What's the disk setup like on these system? You have some pending tasks in MemtablePostFlusher and FlushWriter which may mean there is contention on flushing discarded segments from the commit log. On Wed, Aug 21, 2013 at 5:14 PM, Keith Freeman <8fo...@gmail.com> wrote: > Ok, I tried batching 5

[ANNOUNCE] storm-cassandra 0.4.0-rc2

2013-08-21 Thread P. Taylor Goetz
I'm pleased to announce the release of storm-cassandra 0.4.0-rc2. https://github.com/hmsonline/storm-cassandra Changes in this release: The CassandraMapState implementation now uses row keys instead of column keys to improve performance (Thanks @sorenmacbeth) The CassandraMapState implementatio

Re: HsHa

2013-08-21 Thread Jason Wee
Nature of issue CASSANDRA-4573 compare to "Read an invalid frame size of 0." looks different, nevertheless if someone can test the issue fix would include invalid frame size.. would be awesome! Jason On Wed, Aug 21, 2013 at 4:08 PM, Alain RODRIGUEZ wrote: > @Christopher, not sure if you notice