Re: batch mutates & throughput

2011-08-08 Thread Philippe
> > Hmm, can you create a ticket with a simple way to reproduce that? We > should be giving back an InvalidRequestException for > multiple-mutations-on-same-key instead of erroring out later and > causing timeouts. > Humm... this is actually quite confusing. When I look at the error, I don't see t

Re: batch mutates & throughput

2011-08-08 Thread Philippe
> > Hmm, can you create a ticket with a simple way to reproduce that? We > should be giving back an InvalidRequestException for > multiple-mutations-on-same-key instead of erroring out later and > causing timeouts. > Will do but I'm supposed to be on vacation for a couple weeks now so it will take

Re: batch mutates & throughput

2011-08-08 Thread Jonathan Ellis
Hmm, can you create a ticket with a simple way to reproduce that? We should be giving back an InvalidRequestException for multiple-mutations-on-same-key instead of erroring out later and causing timeouts. On Mon, Aug 8, 2011 at 12:34 AM, Philippe wrote: > Hi Boris, > Thanks for the suggestion, I

Re: batch mutates & throughput

2011-08-07 Thread Philippe
Hi Boris, Thanks for the suggestion, I didn't know there was one. I believe have finally figured it out and it turns out my last two questions are related. First, my batch loading was ignoring a bunch of rows when reading the first file (so it took hundreds of potential mutations for the problem

Re: batch mutates & throughput

2011-08-07 Thread Boris Yen
Maybe you could try to adjust the setting "cassandraThriftSocketTimeout" of hector. https://github.com/rantav/hector/wiki/User-Guide On Mon, Aug 8, 2011 at 6:54 AM, Philippe wrote: > Quick followup. > I have pushed the RPC timeout to 30s. Using Hector, I'm doing 1 thread > doing batches of 10 m

Re: batch mutates & throughput

2011-08-07 Thread Philippe
Quick followup. I have pushed the RPC timeout to 30s. Using Hector, I'm doing 1 thread doing batches of 10 mutates at a time so that's even slower than when I was doing 16 threads in parallel doing non-batched mutations. After a couple hundred execute() calls, I get a timeout for every node; I have

batch mutates & throughput

2011-08-07 Thread Philippe
A question regarding batch mutates and how others might be throttling the system to prevent timeouts. My 3-node, RF=3 cluster has been performing ok while bulk loading data (applying counter updates). I've been able to run 16 threads in parallel that each perform about 400 mutates/s on a loaded cl