Yup, with repair and particularly bootstrap is there is a decent amount of "over streaming" of data due to the fact it's just sending an sstable.
On Fri, 6 May 2016 at 14:49 Anubhav Kale <anubhav.k...@microsoft.com> wrote: > Does repair really send SS Table files as is ? Wouldn’t data for tokens be > distributed across SS Tables ? > > > > *From:* Jeff Jirsa [mailto:jeff.ji...@crowdstrike.com] > *Sent:* Friday, May 6, 2016 2:12 PM > > > *To:* user@cassandra.apache.org > *Subject:* Re: SS Tables Files Streaming > > > > Also probably sstableloader / bulk loading interface > > > > > > > > > > (I don’t think any of these necessarily stream “as-is”, but that’s a > different conversation I suspect) > > > > > > *From: *Jonathan Haddad > *Reply-To: *"user@cassandra.apache.org" > *Date: *Friday, May 6, 2016 at 1:52 PM > *To: *"user@cassandra.apache.org" > *Subject: *Re: SS Tables Files Streaming > > > > Repairs, bootstamp, decommission. > > > > On Fri, May 6, 2016 at 1:16 PM Anubhav Kale <anubhav.k...@microsoft.com> > wrote: > > Hello, > > > > In what scenarios can SS Table files on disk from Node 1 go to Node 2 as > is ? I’m aware this happens in *nodetool rebuild* and I am assuming this > does *not* happen in repairs. Can someone confirm ? > > > > The reason I ask is I am working on a solution for backup / restore and I > need to be sure if I boot a node, start copying over backed up files then > those files won’t get overwritten by something coming from other nodes. > > > > Thanks ! > > -- Ben Bromhead CTO | Instaclustr <https://www.instaclustr.com/> +1 650 284 9692 Managed Cassandra / Spark on AWS, Azure and Softlayer