RE: Best compaction strategy for rarely used data

2023-01-06 Thread onmstester onmstester via user
o:l...@lapo.it> Sent: Friday, December 30, 2022 4:12 AM To: mailto:user@cassandra.apache.org Subject: [EXTERNAL] Re: Best compaction strategy for rarely used data   On 2022-12-29 21: 54, Durity, Sean R via user wrote: > At some point you will end up with large sstables (like 1 TB) t

RE: Best compaction strategy for rarely used data

2022-12-30 Thread Durity, Sean R via user
: [EXTERNAL] Re: Best compaction strategy for rarely used data On 2022-12-29 21: 54, Durity, Sean R via user wrote: > At some point you will end up with large sstables (like 1 TB) that won’t > compact because there are not 4 similar-sized ones able to be compacted Yes, that's exa

Re: Best compaction strategy for rarely used data

2022-12-30 Thread Lapo Luchini
On 2022-12-29 21:54, Durity, Sean R via user wrote: At some point you will end up with large sstables (like 1 TB) that won’t compact because there are not 4 similar-sized ones able to be compacted Yes, that's exactly what's happening. I'll see maybe just one more compaction, since the biggest

RE: Best compaction strategy for rarely used data

2022-12-29 Thread Durity, Sean R via user
, December 29, 2022 4:51 AM To: user@cassandra.apache.org Subject: [EXTERNAL] Re: Best compaction strategy for rarely used data Hi Lapo Take a look at TWCS, I think that could help your use case: https:  //thelastpickle. com/blog/2016/12/08/TWCS-part1. html [thelastpickle. com] Regards Paul Chandler

Re: Best compaction strategy for rarely used data

2022-12-29 Thread Paul Chandler
Hi Lapo Take a look at TWCS, I think that could help your use case: https://thelastpickle.com/blog/2016/12/08/TWCS-part1.html Regards Paul Chandler Sent from my iPhone > On 29 Dec 2022, at 08:55, Lapo Luchini wrote: > > Hi, I have a table which gets (a lot of) data that is written once an

Best compaction strategy for rarely used data

2022-12-29 Thread Lapo Luchini
Hi, I have a table which gets (a lot of) data that is written once and very rarely read (it is used for data that is mandatory for regulatory reasons), and almost never deleted. I'm using the default SCTS as at the time I didn't know any better, but SSTables size are getting huge, which is a p

Re: Best compaction strategy

2018-10-26 Thread Jeff Jirsa
> within the TTL it's the wrong strategy. > This diagram is a good rule of the thumb > > > >> On Thu, Oct 25, 2018 at 6:28 AM Alexander Dejanovski >> wrote: >> Hi Raman, >> >> TWCS is the best compaction strategy for TTL data, even if you

Re: Best compaction strategy

2018-10-25 Thread Jonathan Haddad
To add to what Alex suggested, if you know what keys use what TTL you could store them in different tables, with different window settings. Jon On Fri, Oct 26, 2018 at 1:28 AM Alexander Dejanovski wrote: > Hi Raman, > > TWCS is the best compaction strategy for TTL data, even if

Re: Best compaction strategy

2018-10-25 Thread Alexander Dejanovski
Hi Raman, TWCS is the best compaction strategy for TTL data, even if you have different TTLs (set the time window based on your largest TTL, so it would be 1 day in your case). Enable unchecked tombstone compaction to clear the data with 2 days TTL along the way. This is done by setting : ALTER

Best compaction strategy

2018-10-25 Thread raman gugnani
Hi All, I have one table in which i have some data which has TTL of 2days and some data which has TTL of 60 days. What compaction strategy will suits the most. 1. LeveledCompactionStrategy (LCS) 2. SizeTieredCompactionStrategy (STCS) 3. TimeWindowCompactionStrategy (TWCS) -- Raman Gug

Re: Best compaction strategy for counters tables

2018-01-18 Thread Octavian Rinciog
Hi Alain, Thank you for your response. In my case, counter is the main table, having almost 40% of all data. Thank you for the recommendation about testing on one node. 2018-01-18 13:02 GMT+02:00 Alain RODRIGUEZ : > Hello, > > I believe there is not a really specifically good strategy for counte

Re: Best compaction strategy for counters tables

2018-01-18 Thread Alain RODRIGUEZ
Hello, I believe there is not a really specifically good strategy for counters. Often counter tables size is relatively low (compared to events / raw data). So depending on the workload you might want to pick one or the other. Given the high number of reads the table will have to face (during read

Best compaction strategy for counters tables

2018-01-17 Thread Octavian Rinciog
Hello! I am using Cassandra 3.10. I have a counter table, with the following schema and RF=1 CREATE TABLE edges ( src_id text, src_type text, source text weight counter, PRIMARY KEY ((src_id, src_type), source) ); SELECT vs UPDATE requests ratio for this table is 0.1 READ vs W