On Fri, May 17, 2013 at 11:13 AM, aaron morton <aa...@thelastpickle.com> wrote: > Bootstrapping a new node into the cluster has a small impact on the existing > nodes and the new nodes to have all the data they need when the finish the > process.
Sorry for the pedantry, but bootstrapping from existing replicas cannot guarantee that the new nodes have "all" the data they need when they finish the process. There is a non-zero chance that the failed node contained the single under-replicated copy of a given datum. In practice if your RF is >=2, you are unlikely to experience this type of data loss. But restore-a-backup-then-repair protects you against this unlikely case. =Rob