We also run a nightly "nodetool snapshot" on all nodes, and use duplicity to sync the snapshot to S3, keeping 7 days' worth of backups.
Since duplicity tracks incremental changes this gives you the benefit of point-in-time snapshots without duplicating sstables that are common across multiple backups. It also makes it easy to revert all nodes' state to X days ago in case of accidental or malicious data corruption. On Thu, Jul 24, 2014 at 12:17 PM, Robert Coli <rc...@eventbrite.com> wrote: > On Wed, Jul 23, 2014 at 4:12 PM, Hao Cheng <br...@critica.io> wrote: > >> 3. Using a backup system, either manually via rsync or through something >> like Priam, to directly push backups of the data on ephemeral storage to S3. >> > > https://github.com/JeremyGrosser/tablesnap > > =Rob > >