On Thu, Jul 21, 2011 at 9:14 AM, Jonathan Colby
<jonathan.co...@gmail.com> wrote:
> I regularly run repair on my cassandra cluster.   However, I often seen that 
> during the repair operation very large amounts of data are transferred to 
> other nodes.

https://issues.apache.org/jira/browse/CASSANDRA-2280
https://issues.apache.org/jira/browse/CASSANDRA-2816

> My questions is, if only some data is out of sync,  why are entire Data files 
> being transferred?

Repair streams ranges of files as a unit (which becomes a new file on
the target node) rather than using the normal write path.

-- 
Jonathan Ellis
Project Chair, Apache Cassandra
co-founder of DataStax, the source for professional Cassandra support
http://www.datastax.com

Reply via email to