Note that incremental repair strategies (2.1+) run anti-compaction against sstables in the range being repaired, so this will prevent overstreaming based on the ranges in the repair session.
On Mon, 9 May 2016 at 10:31 Ben Bromhead <b...@instaclustr.com> wrote: > Yup, with repair and particularly bootstrap is there is a decent amount of > "over streaming" of data due to the fact it's just sending an sstable. > > On Fri, 6 May 2016 at 14:49 Anubhav Kale <anubhav.k...@microsoft.com> > wrote: > >> Does repair really send SS Table files as is ? Wouldn’t data for tokens >> be distributed across SS Tables ? >> >> >> >> *From:* Jeff Jirsa [mailto:jeff.ji...@crowdstrike.com] >> *Sent:* Friday, May 6, 2016 2:12 PM >> >> >> *To:* user@cassandra.apache.org >> *Subject:* Re: SS Tables Files Streaming >> >> >> >> Also probably sstableloader / bulk loading interface >> >> >> >> >> >> >> >> >> >> (I don’t think any of these necessarily stream “as-is”, but that’s a >> different conversation I suspect) >> >> >> >> >> >> *From: *Jonathan Haddad >> *Reply-To: *"user@cassandra.apache.org" >> *Date: *Friday, May 6, 2016 at 1:52 PM >> *To: *"user@cassandra.apache.org" >> *Subject: *Re: SS Tables Files Streaming >> >> >> >> Repairs, bootstamp, decommission. >> >> >> >> On Fri, May 6, 2016 at 1:16 PM Anubhav Kale <anubhav.k...@microsoft.com> >> wrote: >> >> Hello, >> >> >> >> In what scenarios can SS Table files on disk from Node 1 go to Node 2 as >> is ? I’m aware this happens in *nodetool rebuild* and I am assuming >> this does *not* happen in repairs. Can someone confirm ? >> >> >> >> The reason I ask is I am working on a solution for backup / restore and I >> need to be sure if I boot a node, start copying over backed up files then >> those files won’t get overwritten by something coming from other nodes. >> >> >> >> Thanks ! >> >> -- > Ben Bromhead > CTO | Instaclustr <https://www.instaclustr.com/> > +1 650 284 9692 > Managed Cassandra / Spark on AWS, Azure and Softlayer > -- Ben Bromhead CTO | Instaclustr <https://www.instaclustr.com/> +1 650 284 9692 Managed Cassandra / Spark on AWS, Azure and Softlayer