On Tue, Oct 29, 2013 at 4:39 PM, Robert Coli <rc...@eventbrite.com> wrote:

> On Tue, Oct 29, 2013 at 4:09 PM, Piavlo <lolitus...@gmail.com> wrote:
>
>> I can't understand how statistically it's possible that with random
>> partitioner some nodes have x2 more keys than others with 50-100 millions
>> of keys per node.
>>
>
> 1) you have very high row variance and really really bad luck with the
> distribution of your super-huge rows
>

Oh, herp, you specified that you have a number-of-keys difference. So it
can't be this one.

But..

First, that number of keys is the number of keys *SEEN BY COMPACTION SINCE
NODE STARTUP* so different runtimes can skew numbers.

Second, that number of keys is the number of times the key has been seen in
all sstables. A single row key with fragments in 5 sstables = "5".

Have you added nodes to the cluster and not run a cleanup compaction?

=Rob

Reply via email to