Seems like my email crashed with Andres’ one. My understanding is we will use the ticket CASSANDRA-17113 as placeholder, the work there will be rebased/reworked etc depending on what we agree with. I also agree with the other points he made. Sounds reasonable to me
On Mon, 24 Oct 2022 at 15:03, Ekaterina Dimitrova <e.dimitr...@gmail.com> wrote: > Thank you Josh > > So about push with/without a single click, I guess you mean to > parameterize whether the step build needs approval or not? Pre-commit the > new flag will use the “no-approval” version, but during development we > still will be able to push the tests without immediately starting all > tests, right? > - parallelism + -h being removed - just to confirm, that means we will not > use xlarge containers. As David confirmed, this is not needed for all jibs > and it is important as otherwise whoever uses paid account will burn their > credits time faster for very similar duration runs. > > CASSANDRA-17930 - I will use the opportunity also to mention that many of > the identified missing jobs in CircleCI will be soon there - Andres is > working on all variations unit tests, I am doing final testing on fixing > the Python upgrade tests (we weren’t using the right parameters and running > way more jobs then we should) and Derek is looking into the rest of the > Python test. I still need to check whether we need something regarding > in-jvm etc, the simulator ones are running only for jdk8 for now, > confirmed. All this should unblock us to be able to do next releases based > on CircleCI as we agreed. Then we move to do some > changes/additions/improvements to Jenkins. And of course, the future > improvements we agreed on. > > On Mon, 24 Oct 2022 at 14:10, Josh McKenzie <jmcken...@apache.org> wrote: > >> Auto-run on push? Can you elaborate? >> >> Yep - instead of having to go to circle and click, when you push your >> branch the circle hook picks it up and kicks off the top level job >> automatically. I tend to be paranoid and push a lot of incremental work >> that's not ready for CI remotely so it's not great for me, but I think >> having it be optional is the Right Thing. >> >> So here's the outstanding work I've distilled from this thread: >> - Create an epic for circleci improvement work (we have a lot of little >> augments to do here; keep it organized and try and avoid redundancy) >> - Include CASSANDRA-17600 in epic umbrella >> - Include CASSANDRA-17930 in epic umbrella >> - Ticket to tune parallelism per job >> - >> > def java_parallelism(src_dir, kind, num_file_in_worker, include = >> lambda a, b: True): >> > d = os.path.join(src_dir, 'test', kind) >> > num_files = 0 >> > for root, dirs, files in os.walk(d): >> > for f in files: >> > if f.endswith('Test.java') and >> include(os.path.join(root, f), f): >> > num_files += 1 >> > return math.floor(num_files / num_file_in_worker) >> > >> > def fix_parallelism(args, contents): >> > jobs = contents['jobs'] >> > >> > unit_parallelism = java_parallelism(args.src, >> 'unit', 20) >> > jvm_dtest_parallelism = java_parallelism(args.src, >> 'distributed', 4, lambda full, name: 'upgrade' not in full) >> > jvm_dtest_upgrade_parallelism = java_parallelism(args.src, >> 'distributed', 2, lambda full, name: 'upgrade' in full) >> - `TL;DR - I find all test files we are going to run, and based off a >> pre-defined variable that says “idea” number of files per worker, I then >> calculate how many workers we need. So unit tests are num_files / 20 ~= 35 >> workers. Can I be “smarter” by knowing which files have higher cost? >> Sure… but the “perfect” and the “average” are too similar that it wasn’t >> worth it...` >> - Ticket to combine pre-commit jobs into 1 pipeline for all JDK's >> - Path to activate all supported JDK's for pre-commit at root >> (one-click pre-merge full validation) >> - Path to activate per JDK below that (interim work partial >> validation) >> - Ticket to rename jobs in circleci >> - Reference comment: >> https://issues.apache.org/jira/browse/CASSANDRA-17939?focusedCommentId=17617016&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-17617016 >> - (buildjdk)_(runjdk)_(testsuite) format: >> - j8_j8_jvm_dtests >> - j8_j11_jvm_dtests >> - j11_j11_jvm_dtest_vnode >> etc >> - Ticket for flag in generate.sh to support auto run on push (see >> response above) >> - Ticket for: remove -h, have -f and -p (free and paid) (probably >> intersects with https://issues.apache.org/jira/browse/CASSANDRA-17600) >> >> Anything wrong w/the above or anything missed? If not, I'll go do some >> JIRA'ing. >> >> >> ~Josh >> >> >> On Fri, Oct 21, 2022, at 3:50 PM, Josh McKenzie wrote: >> >> I am cool with removing circle if apache CI is stable and works, we do >> need to solve the non-committer issue but would argue that partially exists >> in circle today (you can be a non-commuter with a paid account, but you >> can’t be a non-committer with a free account) >> >> There's a few threads here: >> 1. non-committers should be able to run ci >> 2. People that have resources and want to run ci faster should be able to >> do so (assuming the ci of record could serve to be faster) >> 3. ci should be stable >> >> Thus far we haven't landed on 1 system that satisfies all 3. There's some >> background discussions brainstorming how to get there; when / if things >> come from that they'll as always be brought to the list for discussion. >> >> On Fri, Oct 21, 2022, at 1:44 PM, Ekaterina Dimitrova wrote: >> >> I agree with David with one caveat - last time I checked only some Python >> tests lack enough resources with the free tier. The rest run slower than >> with a paid account, but they do fine. In fact I use the free tier if I >> want to test only unit or in-jvm tests sometimes. I guess that is what he >> meant by partially but even being able to run the non-Python tests is a win >> IMHO. If we find a solution for all tests though… even better. >> @Derek your idea sounds interesting, I will be happy to see a proposal. >> Thank you >> >> On Fri, 21 Oct 2022 at 13:39, David Capwell <dcapw...@apple.com> wrote: >> >> I am cool with removing circle if apache CI is stable and works, we do >> need to solve the non-committer issue but would argue that partially exists >> in circle today (you can be a non-commuter with a paid account, but you >> can’t be a non-committer with a free account) >> >> >> >> On Oct 20, 2022, at 2:20 PM, Josh McKenzie <jmcken...@apache.org> wrote: >> >> I believe it's original intention to be just about CircleCI. >> >> It was but fwiw I'm good w/us exploring adjacent things regarding CI >> here. I'm planning on deep diving on the thread tomorrow and distilling a >> snapshot of the work we have a consensus on for circle and summarizing here >> so we don't lose that. Seems like it's fairly non-controversial. >> >> On Thu, Oct 20, 2022, at 5:14 PM, Mick Semb Wever wrote: >> >> >> >> On Thu, 20 Oct 2022 at 22:07, Derek Chen-Becker <de...@chen-becker.org> >> wrote: >> >> Would the preclusion of non-committers also prevent us from configuring >> Jenkins to auto-test on PR independent of who opens it? >> >> One of my current concerns is that we're maintaining 2x the CI for 1x the >> benefit, and I don't currently see an easy way to unify them (perhaps a >> lack of imagination?). I know there's a long history behind the choice of >> CircleCI, so I'm not trying to be hand-wavy about all of the thought that >> went into that decision, but that decision has costs beyond just a paid >> CircleCI account. My long term, probably naive, goals for CI would be to: >> >> 1. Have a CI system that is *fully* available to *any* contributor, >> modulo safeguards to prevent abuse >> >> >> >> This thread is going off-topic, as I believe it's original intention to >> be just about CircleCI. >> >> But on your point… our community CI won't be allowed (by ASF), nor have >> capacity (limited donated resources), to run pre-commit testing by anyone >> and everyone. >> >> Today, trusted contributors can be handed tokens to ci-cassandra.a.o >> (make sure to label them so they can be revoked easily), but we still face >> the issue that too many pre-commit runs impacts the throughput and quality >> of the post-commit runs (though this has improved recently). >> >> It's on my wishlist to be able to: with a single command line; spin up >> the ci-cassandra.a.o stack on any k8s cluster, run any git sha through it >> and collect results, and tear it down. Variations on this would solve >> non-committers being able to repeat, use, and work on their own (or a >> separately donated) CI system, and folk/companies with money to be able to >> run their own ci-cassandra.a.o stacks for faster pre-commit turnaround >> time. Having this reproducibility of the CI system would make testing >> changes to it easier as well, so I'd expect a positive feedback loop here. >> >> I have some rough ideas on how to get started on this, if anyone would >> like to buddy up on it. >> >> >> >>