> I'm trying to understand what will happen when we start deleting the old data.
Are you going to delete data or use the TTL?

> With size tiered compaction, suppose we have one 160Gb sstable and some 
> smaller tables totalling 40Gb.
Not sure on that, it depends on the work load. 

>  My understanding is that, even if we start deleting, we will have to wait 
> for 3 more 160Gb tables to appear, in order to have the first sstable 
> compacted and the disk space freed. 
v1.2 will run compactions on single SSTables that have a high number of 
tombstones 
https://issues.apache.org/jira/browse/CASSANDRA-3442
https://issues.apache.org/jira/browse/CASSANDRA-4234

>  So although we need to store 200Gb worth of data, we'll need something like 
> 800Gb disk space in order to be on the safe side, right?

You want to keep the disks below 75% capacity, and want to have free space to 
handle node moves etc.  
I do not think you need 800GB because of tombstones deletions. 

> What would happen instead with leveled compaction? 
Levelled compaction is more suited to workloads that have a high insert/delete 
ratio. In your case, write once read many data is will suited to Sized Tiered. 

Cheers
 
-----------------
Aaron Morton
Freelance Cassandra Consultant
New Zealand

@aaronmorton
http://www.thelastpickle.com

On 8/03/2013, at 9:13 AM, Flavio Baronti <f.baro...@list-group.com> wrote:

> Hello,
> 
> we are using Cassandra for storing time series data. We never update, only 
> append; we plan to store 1 year worth of data, occupying something around 
> 200Gb. I'm trying to understand what will happen when we start deleting the 
> old data.
> 
> With size tiered compaction, suppose we have one 160Gb sstable and some 
> smaller tables totalling 40Gb. My understanding is that, even if we start 
> deleting, we will have to wait for 3 more 160Gb tables to appear, in order to 
> have the first sstable compacted and the disk space freed. So although we 
> need to store 200Gb worth of data, we'll need something like 800Gb disk space 
> in order to be on the safe side, right?
> 
> What would happen instead with leveled compaction? And why is the default 
> sstable size so small (5Mb)? If we need to store 200Gb, this means we will 
> have 40k sstables; since each one makes 5 files, we'll have 200k files in a 
> single directory, which we'm afraid will undermine the stability of the file 
> system.
> 
> Thank you for your suggestions!
> 
> Flavio
> 

Reply via email to