Re: [SPAM] Re: slow insertion rate with secondary index

Donal Zang Mon, 06 Jun 2011 04:25:47 -0700

On 06/06/2011 10:15, David Boxenhorn wrote:

Is there really a 10x difference between indexed CFs and non-indexed CFs?

Well, as for my test, it is!

I'm using 0.7.6-2, 9 nodes, 3 replicas, write_consistency_level QUORUM,about 90,000,000 rows (~ 1K per row)

I use 20 process, 20rows for each insertion.
the insertion time for the whole row is about 0.02 seconds without index

and then I add a secondary index, and update every row with the indexedcolumn, the insertion time is about 2 seconds

and if I remove the index, and update the column, the time is about 0.002

Another thing I noticed is : if you first do insertion, and then buildthe secondary index use "update column family ...", and then do selectbased on the index, the result is not right (seems the index is stillbeing built though the "update" commands returns quickly). And after awhile, the get_indexed_slices() goes time out from time to time (withpycassa.ConnectionPool('keyspace1', ['host1','host2'], timeout=600,pool_size=1) ).


Does some one else have some same experiences using the secondary indexes?

--
Donal Zang
Computing Center, IHEP
19B YuquanLu, Shijingshan District,Beijing, 100049
zan...@ihep.ac.cn
86 010 8823 6018

Re: [SPAM] Re: slow insertion rate with secondary index

Reply via email to