Hi, If we think it the perspective of column family (table), its rows are split into different nodes(Sharding) based on ring concept in Cassandra. But the core unit of data storage (rows) id not spit across nodes, only copy is maintained in different rows. All column associated to a single row (based on row key) is stored in a single node , but Cassandra keep its copy (based on our configuration) another nodes for fault tolerance. Hadoop(HDFS) does data sharding and replication . It split a file into multiple pieces and also copy these pieces into multiple nodes for fault tolerance. Thanks & Regards Job M Thomas| Platform & Technology SunTec Business Solutions Pvt. Ltd Mob : 7560885748
________________________________ From: Adaryl "Bob" Wakefield, MBA [mailto:adaryl.wakefi...@hotmail.com] Sent: Tue 1/20/2015 9:57 AM To: user@cassandra.apache.org Subject: sharding vs what cassandra does It’s my understanding that the way Cassandra replicates data across nodes is NOT sharding. Can someone provide a better explanation or correct my understanding? B.
<<winmail.dat>>