Re: Why data is not even distributed.

2012-10-08 Thread Andrey Ilinykh
The problem was - I calculated 3 tokens for random partitioner but used them with BOP, so nodes were not supposed to be loaded evenly. That's ok, I got it. But what I don't understand, why nodetool ring shows equal ownership. This is an example: I created small cluster with BOP and three tokens 00

Re: Why data is not even distributed.

2012-10-08 Thread aaron morton
This is an issue with using the BOP. If you are just starting out stick with the Random Partitioner. Cheers - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 5/10/2012, at 10:33 AM, Andrey Ilinykh wrote: > It was my first thought. > Then I md5

Re: Why data is not even distributed.

2012-10-04 Thread Andrey Ilinykh
It was my first thought. Then I md5 uuid and used the digest as a key: MessageDigest md = MessageDigest.getInstance("MD5"); //in the loop UUID uuid = UUID.randomUUID(); byte[] bytes = md.digest(asByteArray(uuid)); the result is exactly the same, first node takes 66%, second 33% and third one is

Re: Why data is not even distributed.

2012-10-04 Thread Tom
Hi Andrey, while the data values you generated might be following a true random distribution, your row key, UUID, is not (because it is created on the same machines by the same software with a certain window of time) For example, if you were using the UUID class in Java, these would be composed f

Why data is not even distributed.

2012-10-03 Thread Andrey Ilinykh
Hello, everybody! I'm observing very strange behavior. I have 3 node cluster with ByteOrderPartitioner. (I run 1.1.5) I created a key space with replication factor of 1. Then I created one column family and populated it with random data. I use UUID as a row key, and Integer as a column name. Row k