Re: RFC: Cassandra Virtual Nodes

2012-03-22 Thread Zhu Han
On Fri, Mar 23, 2012 at 6:54 AM, Peter Schuller wrote: > > You would have to iterate through all sstables on the system to repair > one > > vnode, yes: but building the tree for just one range of the data means > that > > huge portions of the sstables files can be skipped. It should scale down >

Re: RFC: Cassandra Virtual Nodes

2012-03-22 Thread Peter Schuller
> You would have to iterate through all sstables on the system to repair one > vnode, yes: but building the tree for just one range of the data means that > huge portions of the sstables files can be skipped. It should scale down > linearly as the number of vnodes increases (ie, with 100 vnodes, it

Re: RFC: Cassandra Virtual Nodes

2012-03-22 Thread Stu Hood
> > Does the new scheme still require the node to re-iterate all sstables to > build the merkle tree or streaming data for partition level > repair and move? You would have to iterate through all sstables on the system to repair one vnode, yes: but building the tree for just one range of the data

Re: RFC: Cassandra Virtual Nodes

2012-03-22 Thread Zhu Han
On Thu, Mar 22, 2012 at 6:20 PM, Richard Low wrote: > On 22 March 2012 05:48, Zhu Han wrote: > > > I second it. > > > > Is there some goals we missed which can not be achieved by assigning > > multiple tokens to a single node? > > This is exactly the proposed solution. The discussion is about h

Re: RFC: Cassandra Virtual Nodes

2012-03-22 Thread Richard Low
On 22 March 2012 05:48, Zhu Han wrote: > I second it. > > Is there some goals we missed which can not be achieved by assigning > multiple tokens to a single node? This is exactly the proposed solution. The discussion is about how to implement this, and the methods of choosing tokens and replica