> * In the design discussed it is perfectly reasonable for data not to be on
> the archive node.
>
> You mean when having the 2 DC setup I mentioned and using TTL? In case I have
> the 2 DC setup but don't use TTL I don't understand why data wouldn't be on
> the archive node?
Originally you we
Hi,
On Wed, Jan 4, 2012 at 9:54 PM, aaron morton wrote:
> Some thoughts on the plan:
>
> * You are monkeying around with things, do not be surprised when
> surprising things happen.
>
I am just trying to explore different solutions for solving my problem.
> * Deliberately unbalancing the clust
Some thoughts on the plan:
* You are monkeying around with things, do not be surprised when surprising
things happen.
* Deliberately unbalancing the cluster may lead to Bad Things happening.
* In the design discussed it is perfectly reasonable for data not to be on the
archive node.
* Truncat
Hi,
On Tue, Jan 3, 2012 at 8:19 PM, aaron morton wrote:
> Running a time based rolling window of data can be done using the TTL.
> Backing up the nodes for disaster recover can be done using snapshots.
> Restoring any point in time will be tricky because to may restore columns
> where the TTL h
Running a time based rolling window of data can be done using the TTL. Backing
up the nodes for disaster recover can be done using snapshots. Restoring any
point in time will be tricky because to may restore columns where the TTL has
expired.
> Will I get a single copy of the data in the remot
Hi,
I need to build a system that stores data for years, so yes, I am backing
up data in another mass storage system from where it could be later
accessed. The data that I successfully back up has to be deleted from my
cluster to make space for new data coming in.
I was aware about the snapshotti
That sounds a little complicated.
Do you want to get the data out for an off node backup or is it for processing
in another system ?
You may get by using:
* TTL to expire data via compaction
* snapshots for backups
Cheers
-
Aaron Morton
Freelance Developer
@aaronmorton
http:
Hi everyone and Happy New Year!
I need advice for organizing data flow outside of my 3 node Cassandra 0.8.6
cluster. I am configuring my keyspace to use the NetworkTopologyStrategy. I
have 2 data centers each with a replication factor 1 (i.e. DC1:1; DC2:1)
the configuration of the PropertyFileSnit