Re: Secondary indexes performance

2011-06-22 Thread Wojciech Pietrzok
OK, got some results (below). 2 nodes, one on localhost, second on LAN, reading with ConsistencyLevel.ONE, buffer_size=512 rows (that's how many rows pycassa will get on one connection, than it will use last row_id as start row for next query) Queries types: 1) get_range - just added limit of 1024

Secondary indexes performance

2011-06-20 Thread Wojciech Pietrzok
Hello, I've noticed that queries using secondary indexes seems to be getting rather slow. Right now I've got an Column Family with 4 indexed columns (plus 5-6 non indexed columns, column values are small), and around 1,5-2 millions of rows. I'm using pycassa client and query using get_indexed_slic

Re: Inconsistent results using secondary indexes between two DC

2011-05-25 Thread Wojciech Pietrzok
2011/5/23 Jonathan Ellis : >> It was installed as 0.7.2 and upgraded with each new official release. > > I bet that's the problem, then. > https://issues.apache.org/jira/browse/CASSANDRA-2244 could cause > indexes to not be updated for releases < 0.7.4.  You'll want to > rebuild the index. > >> By

Re: Inconsistent results using secondary indexes between two DC

2011-05-23 Thread Wojciech Pietrzok
It was installed as 0.7.2 and upgraded with each new official release. As I wrote in another message in this thread, now nodes are upgraded to 0.7.6 but it still seems that one of the problematic nodes returns inconsistent data. By the way - is it possible to force the rebuild of the secondary ind

Re: Inconsistent results using secondary indexes between two DC

2011-05-22 Thread Wojciech Pietrzok
I've already tried running nodetool repair severail times before but it didn't seem to help. Now I've upgraded Cassandra to 0.7.6, run nodetool scrub, and nodetool repair (twice). One of the problematic nodes seems to return correct results now. But the second one still returns inconsistent data.

Re: Inconsistent results using secondary indexes between two DC

2011-05-19 Thread Wojciech Pietrzok
ing rows are in the > main data CF -- i.e., are they just unindexed, or are they missing > completely? > > On Sun, May 15, 2011 at 4:33 PM, Wojciech Pietrzok wrote: >> Hello, >> >> I've noticed strange behaviour of Cassandra when using secondary indexes. >> T

Inconsistent results using secondary indexes between two DC

2011-05-15 Thread Wojciech Pietrzok
Hello, I've noticed strange behaviour of Cassandra when using secondary indexes. There are 2 Data Centers, each with 2 nodes, RF=4, on all nodes Cassandra 0.7.5 is installed. When I connect to one of the nodes in DC1 and perform query using secondary indexes ("get ColumnFamily where column = 'foo'