On Tue, Oct 30, 2012 at 4:18 AM, 張 睿 <chou...@cyberagent.co.jp> wrote: > Does anyone here know if there is an efficient way to migrate multiple > cassandra clusters' data > to a single cassandra cluster without any dataloss.
Yes. 1) create schema which is superset of all columnfamilies and all keyspaces 2) if all source clusters were the same fixed number of nodes, create a new cluster with the same fixed number of nodes 3) nodetool drain and shut down all nodes on all participating clusters 4) copy sstables from old clusters, maintaining that data from source node [x] ends up on target node [x] 5) start cassandra However without more details as to your old clusters, new clusters, and availability requirements, I can't give you a more useful answer. Here's some background on bulk loading, including "copy-the-sstables." http://palominodb.com/blog/2012/09/25/bulk-loading-options-cassandra =Rob -- =Robert Coli AIM>ALK - rc...@palominodb.com YAHOO - rcoli.palominob SKYPE - rcoli_palominodb