Hi,
 
If we think it the perspective of  column family (table), its rows are split 
into different nodes(Sharding) based on ring concept in Cassandra.
 
But the core unit of data storage (rows) id not spit across nodes, only copy is 
maintained in different rows. All column associated to a single row (based on 
row key) is stored in a single node , but Cassandra  
keep its copy (based on our configuration) another nodes for fault tolerance.
 
Hadoop(HDFS) does data sharding and replication . It split a file into multiple 
pieces and also copy these pieces into multiple nodes for fault tolerance.
 
Thanks & Regards
 
Job M Thomas| Platform & Technology
SunTec Business Solutions Pvt. Ltd
Mob : 7560885748

________________________________

From: Adaryl "Bob" Wakefield, MBA [mailto:adaryl.wakefi...@hotmail.com]
Sent: Tue 1/20/2015 9:57 AM
To: user@cassandra.apache.org
Subject: sharding vs what cassandra does


It’s my understanding that the way Cassandra replicates data across nodes is 
NOT sharding. Can someone provide a better explanation or correct my 
understanding?
B.

<<winmail.dat>>

Reply via email to