On Thu, Jul 21, 2011 at 9:14 AM, Jonathan Colby <jonathan.co...@gmail.com> wrote: > I regularly run repair on my cassandra cluster. However, I often seen that > during the repair operation very large amounts of data are transferred to > other nodes.
https://issues.apache.org/jira/browse/CASSANDRA-2280 https://issues.apache.org/jira/browse/CASSANDRA-2816 > My questions is, if only some data is out of sync, why are entire Data files > being transferred? Repair streams ranges of files as a unit (which becomes a new file on the target node) rather than using the normal write path. -- Jonathan Ellis Project Chair, Apache Cassandra co-founder of DataStax, the source for professional Cassandra support http://www.datastax.com