ah, my flippant comments at the end. 

Instead of "Single point of failure" perhaps I should have said "specialist 
nodes are a bad idea as they may reduce the overall availability of the cluster 
to the availability any one sub group." e.g. a cluster of 10 nodes, where 8 are 
data and 2 are connections may be down for 100% of the keys after the loss of 2 
nodes if they happen to be the connection nodes.  

WRT the partitioner I was thinking of a situation such as 

node 1 : 33.3% of the ring
node 2 : 33.3% of the ring
node 3 : 33.3% of the ring
node 4 : 0.1% of the ring 

My point was that giving the node a small token range would not be enough to 
reduce it's data load. If node 4 was a functioning node in the ring then at RF 
3 it will be asked to be a replica for the data from nodes 2 and 3. Unless the 
replica strategy excluded the node from the list of natural endpoints for all 
but the token range it was responsible for. 

Aaron


On 21 Mar 2011, at 10:28, Robert Coli wrote:

> On Sun, Mar 20, 2011 at 1:20 PM, aaron morton <aa...@thelastpickle.com> wrote:
>> Even if the node is only
>> responsible for a small about of the ring, it would normally still get data
>> handed to it and read from it as a replica. You would need to use a Replica
>> Placement Strategy that knew it ignore the "connection only" nodes.
>> IMHO it's a bad idea: Single point of failure, wasted compute resources,
>> imbalance between "connection" and "worker" nodes.
> 
> As I understand what is being proposed, the node could only be
> responsible for a single token, and presumably would perform very well
> indeed when reading or writing that token. I don't see why you would
> need to avoid placing a single token's worth of data on a node, or why
> it would become a single point of failure if you did.. is there
> something I'm missing.. ?
> 
> =Rob

Reply via email to