Re: [DISCUSS] Should we deprecate / freeze python dtests

Benjamin Lerer Tue, 29 Mar 2022 00:22:04 -0700

>
> They use a separate implementation of instance initialization and thus
> they test the test server rather than the real node.



This is actually my main concern. What is the real gap between the in-JVM
tests server instance and a server as run by python DTests?

Le mar. 29 mars 2022 à 00:08, [email protected] <[email protected]> a
écrit :

> > Other than that, it can be problematic to test upgrades when the
> starting version must run with a different Java version than the end release
>
>
>
> python upgrade tests seem to be particularly limited (from a quick skim,
> primarily testing major upgrade points that are now long in the past), so
> I’m not sure how much of a penalty this is today in practice - but it might
> well become a problem.
>
>
>
> There’s several questions to answer, namely how many versions we want to:
>
>
>
> - test upgrades across
>
> - maintain backwards compatibility of the in-jvm dtest api across
>
> - support a given JVM for
>
>
>
> However, if we need to, we can probably use RMI to transparently support
> multiple JVMs for tests that require it. Since we already use serialization
> to cross the ClassLoader boundary it might not even be very difficult.
>
>
>
>
>
> *From: *Jacek Lewandowski <[email protected]>
> *Date: *Monday, 28 March 2022 at 22:30
> *To: *[email protected] <[email protected]>
> *Subject: *Re: [DISCUSS] Should we deprecate / freeze python dtests
>
> Although I like in-jvm DTests for many scenarios, I can see that they do
> not test the production code as it is. They use a separate
> implementation of instance initialization and thus they test the test
> server rather than the real node. Other than that, it can be problematic to
> test upgrades when the starting version must run with a different Java
> version than the end release. One more thing I've been observing sometimes
> is high consumption of metaspace, which does not seem to be cleaned after
> individual test cases. Given each started instance uses a dedicated class
> loader there is some amount of trash left and when there are a couple of
> multi-node test cases in a single test class, it sometimes happens that the
> test fail with out of memory in metaspace error.
>
>
>
> Thanks,
>
> Jacek
>
>
>
> On Mon, Mar 28, 2022 at 10:06 PM David Capwell <[email protected]> wrote:
>
> I am back and the work for trunk to support vnode is at the last stage of
> review; I had not planned to backport the changes to other branches (aka,
> older branches would only support single token), so if someone would like
> to pick up this work it is rather LHF after 17332 goes in (see trunk patch GH
> PR: trunk <https://github.com/apache/cassandra/pull/1432>).
>
>
>
> I am in favor of deprecating python dtests, and agree we should figure out
> what the gaps are (once vnode support is merged) so we can either shrink
> them or special case to unfreeze (such as startup changes being allowed).
>
>
>
> On Mar 14, 2022, at 6:13 AM, Josh McKenzie <[email protected]> wrote:
>
>
>
> vnode support for in-jvm dtests is in flight and fairly straightforward:
>
>
>
> https://issues.apache.org/jira/browse/CASSANDRA-17332
>
>
>
> David's OOO right now but I suspect we can get this in in April some time.
>
>
>
> On Mon, Mar 14, 2022, at 8:36 AM, [email protected] wrote:
>
> This is the limitation I mentioned. I think this is solely a question of
> supplying an initial config that uses vnodes, i.e. that specifies multiple
> tokens for each node. It is not really a limitation – I believe a dtest
> could be written today using vnodes, by overriding the config’s tokens. It
> does look like the token handling has been refactored since the initial
> implementation to make this a little uglier than should be necessary.
>
>
>
> We should make this trivial, anyway, and perhaps offer a way to run all of
> the dtests with vnodes (and suitably annotating those that cannot be run
> with vnodes). This should be quite easy.
>
>
>
>
>
> *From: *Andrés de la Peña <[email protected]>
> *Date: *Monday, 14 March 2022 at 12:28
> *To: *[email protected] <[email protected]>
> *Subject: *Re: [DISCUSS] Should we deprecate / freeze python dtests
>
> Last time I checked there wasn't support for vnodes on in-jvm dtests,
> which seems an important limitation.
>
>
>
> On Mon, 14 Mar 2022 at 12:24, [email protected] <[email protected]>
> wrote:
>
> I am strongly in favour of deprecating python dtests in all cases where
> they are currently superseded by in-jvm dtests. They are environmentally
> more challenging to work with, causing many problems on local and remote
> machines. They are harder to debug, slower, flakier, and mostly less
> sophisticated.
>
>
>
> > all focus on getting the in-jvm framework robust enough to cover
> edge-cases
>
>
>
> Would be great to collect gaps. I think it’s just vnodes, which is by no
> means a fundamental limitation? There may also be some stuff to do
> startup/shutdown and environmental scripts, that may be a niche we retain
> something like python dtests for.
>
>
>
> > people aren’t familiar
>
>
>
> I would be interested to hear from these folk to understand their concerns
> or problems using in-jvm dtests, if there is a cohort holding off for this
> reason
>
>
>
> > This is going to require documentation work from some of the original
> authors
>
>
>
> I think a collection of template-like tests we can point people to would
> be a cheap initial effort. Cutting and pasting an existing test with the
> required functionality, then editing to suit, should get most people off to
> a quick start who aren’t familiar.
>
>
>
> > Labor and process around revving new releases of the in-jvm dtest API
>
>
>
> I think we need to revisit how we do this, as it is currently broken. We
> should consider either using ASF snapshots until we cut new releases of C*
> itself, or else using git subprojects. This will also become a problem for
> Accord’s integration over time, and perhaps other subprojects in future, so
> it is worth better solving this.
>
>
>
> I think this has been made worse than necessary by moving too many
> implementation details to the shared API project – some should be retained
> within the C* tree, with the API primarily serving as the shared API itself
> to ensure cross-version compatibility. However, this is far from a complete
> explanation of (or solution to) the problem.
>
>
>
>
>
>
>
> *From: *Josh McKenzie <[email protected]>
> *Date: *Monday, 14 March 2022 at 12:11
> *To: *[email protected] <[email protected]>
> *Subject: *[DISCUSS] Should we deprecate / freeze python dtests
>
> I've been wrestling with the python dtests recently and that led to some
> discussions with other contributors about whether we as a project should be
> writing new tests in the python dtest framework or the in-jvm framework.
> This discussion has come up tangentially on some other topics, including
> the lack of documentation / expertise on the in-jvm framework
> dis-incentivizing some folks from authoring new tests there vs. the
> difficulty debugging and maintaining timer-based, sleep-based
> non-deterministic python dtests, etc.
>
>
>
> I don't know of a place where we've formally discussed this and made a
> project-wide call on where we expect new distributed tests to be written;
> if I've missed an email about this someone please link on the thread here
> (and stop reading! ;))
>
>
>
> At this time we don't specify a preference for where you write new
> multi-node distributed tests on our "development/testing" portion of the
> site and documentation:
> https://cassandra.apache.org/_/development/testing.html
>
>
>
> The primary tradeoffs as I understand them for moving from python-based
> multi-node testing to jdk-based are:
>
> Pros:
>
>    1. Better debugging functionality (breakpoints, IDE integration, etc)
>    2. Integration with simulator
>    3. More deterministic runtime (anecdotally; python dtests _should_ be
>    deterministic but in practice they prove to be very prone to environmental
>    disruption)
>    4. Test time visibility to internals of cassandra
>
> Cons:
>
>    1. The framework is not as mature as the python dtest framework (some
>    functionality missing)
>    2. Labor and process around revving new releases of the in-jvm dtest
>    API
>    3. People aren't familiar with it yet and there's a learning curve
>
>
>
> So my bid here: I personally think we as a project should freeze writing
> new tests in the python dtest framework and all focus on getting the in-jvm
> framework robust enough to cover edge-cases that might still be causing new
> tests to be written in the python framework. This is going to require
> documentation work from some of the original authors of the in-jvm
> framework as well as folks currently familiar with it and effort from those
> of us not yet intimately familiar with the API to get to know it, however I
> believe the long-term benefits to the project will be well worth it.
>
>
>
> We could institute a pre-commit check that warns on a commit increasing
> our raw count of python dtests to help provide process-based visibility to
> this change in direction for the project's testing.
>
>
>
> So: what do we think?
>
>
>
>

Re: [DISCUSS] Should we deprecate / freeze python dtests

Reply via email to