Re: horizontal query scaling issues follow on

Jonathan Haddad Thu, 17 Jul 2014 22:03:38 -0700

The problem with starting without vnodes is moving to them is a bit
hairy.  In particular, nodetool shuffle has been reported to take an
extremely long time (days, weeks).  I would start with vnodes if you
have any intent on using them.


On Thu, Jul 17, 2014 at 6:03 PM, Robert Coli <rc...@eventbrite.com> wrote:
> On Thu, Jul 17, 2014 at 5:16 PM, Diane Griffith <dfgriff...@gmail.com>
> wrote:
>>
>> I did tests comparing 1, 2, 10, 20, 50, 100 clients spawned all querying.
>> Performance on 2 nodes starts to degrade from 10 clients on.  I saw similar
>> behavior on 4 nodes but haven't done the official runs on that yet.
>
>
> Ok, if you've multi-threaded your client, then you aren't starving for
> client thread paralellism, and that rules out another scalability
> bottleneck.
>
> As a brief aside, you "only lose" from vnodes until your cluster is larger
> than a certain sizes, and then only when adding or removing nodes from a
> cluster. Perhaps if you are ramping up and scientifically testing smaller
> cluster sizes, you should start at first with a token per range, ie
> "pre-vnodes" operation?
>
>> I basically did the command and it was outputting 256 tokens on each node
>> and comma separated.  So I tried taking that string and setting that as the
>> value to initial_token but the node wouldn't start up.
>>
>> Not sure if I maybe had a carriage return in there and that was the
>> problem.
>
>
> It should take a comma delimited list of tokens, did the failed node startup
> log any error?
>
>>
>> And if I do that do I need to do more than comment out num_tokens?
>
>
> No, though you probably should anyway in order to be unambiguous.
>
> =Rob
>



-- 
Jon Haddad
http://www.rustyrazorblade.com
skype: rustyrazorblade

Re: horizontal query scaling issues follow on

Reply via email to