Re: scheduled work compaction strategy

2018-02-16 Thread Jeff Jirsa
There’s a company using TWCS in this config - I’m not going to out them, but I think they do it (or used to) with aggressive tombstone sub properties. They may have since extended/enhanced it somewhat. -- Jeff Jirsa > On Feb 16, 2018, at 2:24 PM, Carl Mueller > wrote: > > Oh and as a furth

Re: scheduled work compaction strategy

2018-02-16 Thread Carl Mueller
An even MORE complicated version could address the case where the TTLs are at the column key rather than the row key. That would divide the row across sstables by the rowkey, in essence the opposite of what most compaction strategies try to do: eventually centralize the data for a rowkey in one sst

Re: scheduled work compaction strategy

2018-02-16 Thread Carl Mueller
Oh and as a further refinement outside of our use case. If we could group/organize the sstables by the rowkey time value or inherent TTL value, the naive version would be evenly distributed buckets into the future. But many/most data patterns like this have "busy" data in the near term. Far out s

scheduled work compaction strategy

2018-02-16 Thread Carl Mueller
We have a scheduler app here at smartthings, where we track per-second tasks to be executed. These are all TTL'd to be destroyed after the second the event was registered with has passed. If the scheduling window was sufficiently small, say, 1 day, we could probably use a time window compaction s

Re: row tombstones as a separate sstable citizen

2018-02-16 Thread Carl Mueller
re: the tombstone sstables being read-only inputs to compaction, there would be one case the non-tombstone sstables would input to the compaction of the row tombstones: when the row no longer exists in any of the data sstables with respect to the row tombstone timestamp. There may be other opportu

Re: Release votes

2018-02-16 Thread Ariel Weisberg
Hi, I created https://issues.apache.org/jira/browse/CASSANDRA-14241 for this issue. You are right there is a solid chunk of failing tests on Apache infrastructure that don't fail on CircleCI. I'll find someone to get it done. I think that fix before commit is only going to happen if we go all t

[RELEASE] Apache Cassandra 2.2.12 released - PLEASE READ NOTICE

2018-02-16 Thread Michael Shuler
PLEASE READ: MAXIMUM TTL EXPIRATION DATE NOTICE (CASSANDRA-14092) -- The maximum expiration timestamp that can be represented by the storage engine is 2038-01-19T03:14:06+00:00, which means that inserts with TTL thatl expire after thi

[RELEASE] Apache Cassandra 2.1.20 released - PLEASE READ NOTICE

2018-02-16 Thread Michael Shuler
PLEASE READ: MAXIMUM TTL EXPIRATION DATE NOTICE (CASSANDRA-14092) -- The maximum expiration timestamp that can be represented by the storage engine is 2038-01-19T03:14:06+00:00, which means that inserts with TTL thatl expire after thi

Re: Release votes

2018-02-16 Thread Josh McKenzie
It stands out to me that Google a) does pre-submit test runs, b) has a dedicated team to identify and work with flaky tests, c) has some kind of quarantining, and d) doesn't report on a test being flaky unless it fails 3x in a row if it's marked as flaky. While we obviously can't pursue b since we'

[VOTE PASSED] Release Apache Cassandra 2.2.12

2018-02-16 Thread Michael Shuler
With 10 binding +1, 1 non-binding +1, and no other votes, this vote for 2.2.12 passes. I'll upload the artifacts today. -- Kind regards, Michael On 02/12/2018 02:30 PM, Michael Shuler wrote: > I propose the following artifacts for release as 2.2.12. > > sha1: 1602e606348959aead18531cb8027afb15f

[VOTE PASSED] Release Apache Cassandra 2.1.20

2018-02-16 Thread Michael Shuler
With 9 binding +1 and no other votes, the 2.1.20 release passes. I will get the artifacts uploaded today. -- Kind regards, Michael On 02/12/2018 02:30 PM, Michael Shuler wrote: > I propose the following artifacts for release as 2.1.20. > > sha1: b2949439ec62077128103540e42570238520f4ee > Git: >

Re: Release votes

2018-02-16 Thread Jason Brown
Hi, I'm ecstatic others are now running the tests and, more importantly, that we're having the conversation. I've become convinced we cannot always have 100% green tests. I am reminded of this [1] blog post from Google when thinking about flaky tests. The TL;DR is "flakiness happens", to the tune

Re: [VOTE] (Take 2) Release Apache Cassandra 3.0.16

2018-02-16 Thread Tommy Stendahl
+1 On 2018-02-14 21:40, Michael Shuler wrote: I propose the following artifacts for release as 3.0.16. sha1: 890f319142ddd3cf2692ff45ff28e71001365e96 Git: http://git-wip-us.apache.org/repos/asf?p=cassandra.git;a=shortlog;h=refs/tags/3.0.16-tentative Artifacts: https://repository.apache.org/con

Re: [VOTE] (Take 3) Release Apache Cassandra 3.11.2

2018-02-16 Thread Tommy Stendahl
+1 On 2018-02-14 22:09, Michael Shuler wrote: I propose the following artifacts for release as 3.11.2. sha1: 1d506f9d09c880ff2b2693e3e27fa58c02ecf398 Git: http://git-wip-us.apache.org/repos/asf?p=cassandra.git;a=shortlog;h=refs/tags/3.11.2-tentative Artifacts: https://repository.apache.org/con

Re: Release votes

2018-02-16 Thread Dinesh Joshi
I'm new to this project and here are my two cents. If there are tests that are constantly failing or flaky and you have had releases despite their failures, then they're not useful and can be disabled. They can always be reenabled if they are in fact valuable. Having 100% blue dashboard is not i