Hi Viktor,

As i mentioned, my goal is to finally achieve 100 reads per second
throughput, its not a batch size of 100.

The writes are already done, and i am not doing them anymore. I loaded the
system with about 130 million keys.

I am just doing a read workload in my experiment as of now..

1. no invalidation of cache happenning because no writes are happenning. GC
is not an issue, its an off-heap native cache by  default in 1.0.9
2. reads are with batch size 1
3. read qps 25

I am keeping max read qps constant, and just varying the number of threads
doing the reads.

row cache hit ratio = 0.66

Observations:

1. With 20 threads doing reads, avg latency is 50 ms
2. With 6 threads doing reads, avg latency is 30 ms
3. With 2 threads doing reads, avg latency is 15 ms
4. With 3 threads, latency is 20 ms

Looks like the number of disks (2) are limiting the concurrency of the
system here. any other explanations?
/G


On Thu, May 17, 2012 at 10:49 PM, Viktor Jevdokimov <
viktor.jevdoki...@adform.com> wrote:

>  Row cache is ok until keys are not heavily updated, otherwise it
> frequently invalidates and pressures GC.****
>
> ** **
>
> The high latency is from your batch of 100 keys. Review your data model to
> avoid such reads, if you need low latency.****
>
> ** **
>
> 500M rows on one node, or on the cluster? Reading 100 random rows at total
> of 40KB data from a data set of 180GB uncompressed under 30ms is not an
> easy task.****
>
> ** **
>
> ** **
>
>
>    Best regards / Pagarbiai
> *Viktor Jevdokimov*
> Senior Developer
>
> Email: viktor.jevdoki...@adform.com
> Phone: +370 5 212 3063, Fax +370 5 261 0453
> J. Jasinskio 16C, LT-01112 Vilnius, Lithuania
> Follow us on Twitter: @adforminsider <http://twitter.com/#!/adforminsider>
> What is Adform: watch this short video <http://vimeo.com/adform/display>
>  [image: Adform News] <http://www.adform.com>
>
> Disclaimer: The information contained in this message and attachments is
> intended solely for the attention and use of the named addressee and may be
> confidential. If you are not the intended recipient, you are reminded that
> the information remains the property of the sender. You must not use,
> disclose, distribute, copy, print or rely on this e-mail. If you have
> received this message in error, please contact the sender immediately and
> irrevocably delete this message and any copies.
>
>   *From:* Gurpreet Singh [mailto:gurpreet.si...@gmail.com]
> *Sent:* Thursday, May 17, 2012 20:24
> *To:* user@cassandra.apache.org
> *Subject:* Re: cassandra read latency help****
>
> ** **
>
> Thanks Viktor for the advice.****
>
> Right now, i just have 1 node that i am testing against and i am using CL
> one.****
>
> Are you suggesting that the page cache might be doing better than the row
> cache?
> I am getting row cache hit of 0.66 right now.****
>
> ** **
>
> /G****
>
> ** **
>
> On Thu, May 17, 2012 at 12:26 AM, Viktor Jevdokimov <
> viktor.jevdoki...@adform.com> wrote:****
>
> > Gurpreet Singh wrote:
> > Any ideas on what could help here bring down the read latency even more ?
> ****
>
> Avoid Cassandra forwarding request to other nodes:
> - Use consistency level ONE;
> - Create data model to do single request with single key, since different
> keys may belong to different nodes and requires forwarding requests to them;
> - Use smart client to calculate token for key and select appropriate node
> (primary or replica) by token range;
> - Turn off Dynamic Snitch (it may forward request to other replica even it
> has the data);
> - Have all or hot data in page cache (no HDD disk IO) or use SSD;
> - If you do regular updates to key, do not use row cache, otherwise you
> may try.
>
>
>
>
> Best regards / Pagarbiai
>
> Viktor Jevdokimov
> Senior Developer
>
> Email: viktor.jevdoki...@adform.com
> Phone: +370 5 212 3063
> Fax: +370 5 261 0453
>
> J. Jasinskio 16C,
> LT-01112 Vilnius,
> Lithuania
>
>
>
> Disclaimer: The information contained in this message and attachments is
> intended solely for the attention and use of the named addressee and may be
> confidential. If you are not the intended recipient, you are reminded that
> the information remains the property of the sender. You must not use,
> disclose, distribute, copy, print or rely on this e-mail. If you have
> received this message in error, please contact the sender immediately and
> irrevocably delete this message and any copies.****
>
> ** **
>

<<signature-logo29.png>>

Reply via email to