Re: [DISCUSS] Replace airlift/airline library with Picocli

2024-07-17 Thread Claude Warren, Jr via dev
My CRA arguments basically revolve around the "Open Source Steward" from the CRA. As far as I recall, for open source software to be used in commercial projects it must be maintained by a steward. The definition of steward is being discussed but foundations generally meet the requirement, project

Re: [DISCUSS] Replace airlift/airline library with Picocli

2024-07-16 Thread Claude Warren, Jr via dev
There are several reasons to consider updating, foremost in my mind is the changes coming as part of CRA in Europe. IANAL, but I don't think that non-maintained code will meet the CRA requirements, nor will code maintained by a single individual. Our best approach may be to try to get picocli me

Re: [VOTE] Release Apache Cassandra 5.0-rc1

2024-07-03 Thread Claude Warren, Jr via dev
issues that will be > corrected before an RC. > > How does that differ from beta in our lifecycle? API stable but a test bed > to suss out issues like this. > > > On Mon, Jul 1, 2024, at 9:30 AM, Claude Warren, Jr via dev wrote: > > Perhaps we should consider a Mil

Re: [VOTE] Release Apache Cassandra 5.0-rc1

2024-07-01 Thread Claude Warren, Jr via dev
Perhaps we should consider a Milestone release. At least in some projects this is a way to provide a test bed with known issues that will be corrected before an RC. On Sun, Jun 30, 2024 at 9:50 PM Jon Haddad wrote: > This came in after our vote, but we might also have a problem with > performin

Re: [DISCUSS] Stream Pipelines on hot paths

2024-06-13 Thread Claude Warren, Jr via dev
I brought this topic to commons-collections because we use some streaming in the Bloom filter implementation where we are very sensitive to processing time. I received this answer over there and thought I would bring the information here: You need to test it with some realistic data for a benchma

Re: [DISCUSS] CEP-42: Constraints Framework

2024-06-12 Thread Claude Warren, Jr via dev
> > 2) > Is part of an enum is somehow suplying the lack of enum types. Constraint > could be something like CONSTRAINT belongsToEnum([list of valid values], > field): > CREATE TABLE keyspace.table ( > field text CONSTRAINT belongsToEnum(['foo', 'foo2'], field), > ... > ); > 3) > Similarly, we

Re: [DISCUSS] CEP-40: Data Transfer Using Cassandra Sidecar for Live Migrating Instances

2024-05-01 Thread Claude Warren, Jr via dev
Alex, you write: > We can implement CEP-40 using a similar approach: we can leave the source > node as both a read and write target, and allow the new node to be a target > for (pending) writes. Unfortunately, this does not help with availability > (in fact, it decreases write availability, sinc

Re: discuss: add to_human_size function

2024-04-25 Thread Claude Warren, Jr via dev
27;) > > I think this is more flexible and we should funnel this via > DataStorageSpec and similar as you mentioned. > > In the future, we might also add to_human_duration which would be > implemented against DurationSpec so similar conversions are possible. > > On Fri, Apr 19

Re: discuss: add to_human_size function

2024-04-19 Thread Claude Warren, Jr via dev
I like the idea. Is the intention to have the of the function be parsable by the config parsers like DataRateSpec, DataStorageSpec, or DurationSpec? Claude On Thu, Apr 18, 2024 at 9:47 PM Ariel Weisberg wrote: > Hi, > > I think it’s a good quality of life improvement, but I am someone who > b

Re: [DISCUSS] CEP-40: Data Transfer Using Cassandra Sidecar for Live Migrating Instances

2024-04-18 Thread Claude Warren, Jr via dev
I think this solution would solve one of the problems that Aiven has with node replacement currently. Though TCM will probably help as well. On Mon, Apr 15, 2024 at 11:47 PM German Eichberger via dev < dev@cassandra.apache.org> wrote: > Thanks for the proposal. I second Jordan that we need more

Re: Default table compression defined in yaml.

2024-03-21 Thread Claude Warren, Jr via dev
t;"sstable" key sounds good to me. > > - - -- --- - - > Jacek Lewandowski > > > czw., 21 mar 2024 o 08:32 Claude Warren, Jr via dev < > dev@cassandra.apache.org> napisał(a): > >> Jacek, >> >> I am a bit confused here.

Re: Default table compression defined in yaml.

2024-03-21 Thread Claude Warren, Jr via dev
123` format in the cassandra.yaml file is >>> deprecated, and the new format is `foobar: 123KiB`. Is there a need to >>> introduce new settings entries with the deprecated format only to be >>> removed at a later version? >>> >>> >>> On 18/03/202

Re: Default table compression defined in yaml.

2024-03-19 Thread Claude Warren, Jr via dev
t;> deprecated, and the new format is `foobar: 123KiB`. Is there a need to >> introduce new settings entries with the deprecated format only to be >> removed at a later version? >> >> >> On 18/03/2024 14:39, Claude Warren, Jr via dev wrote: >> >> After m

Re: Default table compression defined in yaml.

2024-03-19 Thread Claude Warren, Jr via dev
precated format only to be > removed at a later version? > > > On 18/03/2024 14:39, Claude Warren, Jr via dev wrote: > > After much work by several people, I have pulled together the changes to > define the default compression in the cassandra.yaml file and have created > a pull

Default table compression defined in yaml.

2024-03-18 Thread Claude Warren, Jr via dev
After much work by several people, I have pulled together the changes to define the default compression in the cassandra.yaml file and have created a pull request [1]. If you are interested this in topic, please take a look at the changes and give at least a cursory review. [1] https://github.co

Patently invalid Compression parameters in CompressedSequentialWriterTest

2024-03-15 Thread Claude Warren, Jr via dev
I have been working at cleaning up the Yaml configuration for default table compression settings and found that the CompressedSequentialWriterTest uses some parameters that are outside the acceptable limits (like bufferLength not a power of 2, or maxCompressedLength > bufferLength). I can understa

[DISCUSS] Update cassandra-stress to use Apache Commons CLI (CASSANDRA-18661)

2024-03-08 Thread Claude Warren, Jr via dev
I have been working on CASSANDRA-18661 to see if it is possible to migrate to the Apache commons-cli as noted in the ticket. It is possible to do so, and after several pull requests to commons-cli, I have managed to migrate the settings of the stress tool. We will have to wait for commons-cli 1.7

Re: Call for Presentations closing soon: Community over Code EU 2024

2024-01-09 Thread Claude Warren, Jr via dev
Additionally, if you have a talk about some underlying technology that could be applicable across multiple projects submit it or a poster based on it. We are looking for good cross-project presentations. Claude Chair, Community over Code, EU 2024. On Mon, Jan 8, 2024 at 8:24 PM Paulo Motta wrot

Re: [DISCUSS] Replace Sigar with OSHI (CASSANDRA-16565)

2023-12-18 Thread Claude Warren, Jr via dev
The pull request is : https://github.com/apache/cassandra/pull/2842 On Mon, Dec 18, 2023 at 10:26 AM Mick Semb Wever wrote: > > > Can I get an another review/approval for the pull request? >> https://github.com/apache/cassandra/pull/2842/files >> > > > It is not clear on the ticket what is being

Re: [DISCUSS] Replace Sigar with OSHI (CASSANDRA-16565)

2023-12-17 Thread Claude Warren, Jr via dev
Can I get an another review/approval for the pull request? https://github.com/apache/cassandra/pull/2842/files On Fri, Dec 15, 2023 at 4:04 AM guo Maxwell wrote: > +1 too > > Mick Semb Wever 于2023年12月15日周五 10:01写道: > >> >> >> >>> >>> Are there objections to making this switch and adding a new d

Re: [DISCUSS] CEP-36: A Configurable ChannelProxy to alias external storage locations

2023-12-14 Thread Claude Warren
Is there still interest in this? Can we get some points down on electrons so that we all understand the issues? While it is fairly simple to redirect the read/write to something other than the local system for a single node this will not solve the problem for tiered storage. Tiered storage w

[DISCUSS] Replace Sigar with OSHI (CASSANDRA-16565)

2023-12-14 Thread Claude Warren, Jr via dev
Greetings, I have submitted a pull request[1] that replaces the unsupported Sigar library with the maintained OSHI library. OSHI is an MIT licensed library that provides information about the underlying OS much like Sigar did. The change adds a dependency on oshi-core at the following coordinate

Re: Custom FSError and CommitLog Error Handling

2023-12-12 Thread Claude Warren, Jr via dev
I can see this as a strong improvement in Cassandra management and support it. +1 non binding On Mon, Dec 11, 2023 at 8:28 PM Raymond Huffman wrote: > Hello All, > > On our fork of Cassandra, we've implemented some custom behavior for > handling CommitLog and SSTable Corruption errors. Specific

Re: [DISCUSS] CASSANDRA-19104: Standardize tablestats formatting and data units

2023-12-04 Thread Claude Warren, Jr via dev
Why not change the option so that -H will operate as it does now while -Hn (where n is a digit) will limit the number of decimal places to n. On Mon, Dec 4, 2023 at 5:11 PM Brad wrote: > Thanks, Jacek. Using three significant digits for disk space is a good > suggestion. > > On Mon, Dec 4, 2023

Re: Immediately Deprecated Code

2023-11-01 Thread Claude Warren, Jr via dev
> For offline upgrades, we are aiming to maintain all compatibility. > > Take care when removing code, there are various (serdes) classes that look > like they are for other components but are also used in the storage engine. > > > > On Tue, 31 Oct 2023 at 18:42, Claude Warr

Re: Immediately Deprecated Code

2023-10-31 Thread Claude Warren, Jr via dev
you upgrade >> logic was triggered. >> >> (1) >> https://github.com/apache/cassandra/blob/trunk/test/distributed/org/apache/cassandra/distributed/upgrade/UpgradeTestBase.java#L97-L108 >> >> >> From: Claude Warren, Jr

Re: [DISCUSS] CEP-36: A Configurable ChannelProxy to alias external storage locations

2023-10-31 Thread Claude Warren, Jr via dev
t to 20TB / node, easy scale up / > down, etc, and always-ZCS for non-object store deployments. > > Jon > > On 2023/09/25 06:48:06 "Claude Warren, Jr via dev" wrote: > > I have just filed CEP-36 [1] to allow for keyspace/table storage outside > of > > the sta

Re: Immediately Deprecated Code

2023-10-31 Thread Claude Warren, Jr via dev
sandra/blob/trunk/test/distributed/org/apache/cassandra/distributed/upgrade/UpgradeTestBase.java#L97-L108 > > ____ > From: Claude Warren, Jr via dev > Sent: Tuesday, October 31, 2023 10:57 > To: dev > Cc: Claude Warren, Jr > Subject: Immediatel

Immediately Deprecated Code

2023-10-31 Thread Claude Warren, Jr via dev
I was thinking about code that is used to migrate from one version to another. For example the code that rewrote the order of the hash values used for Bloom filters. That code was necessary for the version it was coded in. But the next version does not need that code because the next version is

Re: CASSANDRA-18775 (Cassandra supported OSs)

2023-10-25 Thread Claude Warren, Jr via dev
I closed 18775 as it did not seem reasonable after discussions here. I have been working on 16565 and have a pull request [1] and an experimental suite to show the differences. [2] [1] https://github.com/apache/cassandra/pull/2842 [2] https://github.com/Aiven-Labs/compare_oshi_sigar On Wed, Oc

Development Dependencies documentation.

2023-10-25 Thread Claude Warren, Jr via dev
I just had to change dependencies in Cassandra for the first time and I think the documentation [1] is out of date. First I think most of the file edits are in the ".build" directory. Adding jars to the "lib" directory works until calling "ant realclean", so perhaps the instructions should inclu

Re: CASSANDRA-16565

2023-10-25 Thread Claude Warren, Jr via dev
I ended up posting the code at https://github.com/Aiven-Labs/compare_oshi_sigar if anyone wants to take a look and see if they get differing results on various systems. On Tue, Oct 24, 2023 at 4:59 PM Brandon Williams wrote: > On Tue, Oct 24, 2023 at 7:48 AM Claude Warren, Jr via dev >

CASSANDRA-16565

2023-10-24 Thread Claude Warren, Jr via dev
I am working on https://issues.apache.org/jira/browse/CASSANDRA-16565 and have a small testing program that executes the sigar and equivalent OSHI methods to verify that they are the same. I would like to have this run on various platforms. I have tgz with all the libraries and code as well as a

CASSANDRA-18775 (Cassandra supported OSs)

2023-10-20 Thread Claude Warren, Jr via dev
I am looking at https://issues.apache.org/jira/browse/CASSANDRA-18775 and want to ensure that I do not remove too many libraries. I think that preserving any sigar library where the file name contains the word "linux" or "macosx" should be acceptable. This will preserve: libsigar-amd64-linux.so l

Re: [DISCUSS] CEP-36: A Configurable ChannelProxy to alias external storage locations

2023-10-19 Thread Claude Warren, Jr via dev
8, 2023 at 10:06 AM guo Maxwell wrote: > > If it is ok for Henrik to rebase the Astra implementation of this > functionality (FileSystemProvider) onto Cassandra trunk. > > Then we can create a jira to move this forward for a small step. > > Claude Warren, Jr 于2023年10月18日周三 15:05写

Re: [DISCUSS] CommitLog default disk access mode

2023-10-18 Thread Claude Warren, Jr via dev
I think introducing the feature is a good idea. I also think that it should _NOT_ be enabled by default for all the reasons stated above. Finding a cohort of users who are interested in turning it on would provide a nice testbed to shake out any issues without affecting everyone. On Tue, Oct 17,

Re: [DISCUSS] CEP-36: A Configurable ChannelProxy to alias external storage locations

2023-10-18 Thread Claude Warren, Jr via dev
other FileSystem implementations. I look forward to hearing from you, Claude On Wed, Oct 18, 2023 at 9:00 AM Claude Warren, Jr wrote: > After a bit more analysis and some testing I have a new branch that I > think solves the problem. [1] I have also created a pull request internal > to

Re: [DISCUSS] CEP-36: A Configurable ChannelProxy to alias external storage locations

2023-10-18 Thread Claude Warren, Jr via dev
:28 AM Claude Warren, Jr wrote: > I have been exploring adding a second Path to the Cassandra File object. > The original path being the path within the standard Cassandra directory > tree and the second being a translated path when there is what was called a > ChannelProxy in place. &

Re: [DISCUSS] CEP-36: A Configurable ChannelProxy to alias external storage locations

2023-10-10 Thread Claude Warren, Jr via dev
i, Sep 29, 2023 at 8:07 AM Claude Warren, Jr wrote: > Sorry I was out sick and did not respond yesterday. > > Henrik, How does your system work? What is the design strategy? Also is > your code available somewhere? > > After looking at the code some more I think that the best solut

multiple ParameterizedClass objects?

2023-10-03 Thread Claude Warren, Jr via dev
I have a case where I would like to be able to specify a collection of ParameterizedClass objects in the configuration file. Is there a standard way to do this? If not, does anyone have a suggestion for a clean way to implement it? Claude

Re: [DISCUSS] CEP-36: A Configurable ChannelProxy to alias external storage locations

2023-09-28 Thread Claude Warren, Jr via dev
ific behaviors that > show up above the `ChannelProxy` layer that will depend on the backend. > > > > Ideally there would be some config to specify several backend > filesystems and their individual configuration that can be used, as well as > configuration and support fo

Re: [DISCUSS] CEP-36: A Configurable ChannelProxy to alias external storage locations

2023-09-26 Thread Claude Warren, Jr via dev
backend > filesystems and their individual configuration that can be used, as well as > configuration and support for a "backend file router" for file creation > (and opening) that can be used to route files to the backend most > appropriate. > > > > Regards,

Re: [DISCUSS] CEP-36: A Configurable ChannelProxy to alias external storage locations

2023-09-26 Thread Claude Warren, Jr via dev
which feels a bit customized for a certain user > and is not universal enough.Am I right ? > > Claude Warren, Jr 于2023年9月26日周二 14:36写道: > >> My intention is to develop an S3 storage system using >> https://github.com/carlspring/s3fs-nio >> >> There are several issues

Re: [DISCUSS] CEP-36: A Configurable ChannelProxy to alias external storage locations

2023-09-25 Thread Claude Warren, Jr via dev
e discussing the whole final state, and not an implementation >> detail of one part of the final state? >> >> >> >> >> >> >> On Sun, Sep 24, 2023 at 11:49 PM Claude Warren, Jr via dev < >> dev@cassandra.apache.org> wrote: >> >>

Re: [DISCUSS] CEP-36: A Configurable ChannelProxy to alias external storage locations

2023-09-25 Thread Claude Warren, Jr via dev
de, SSD, HDD, and data placement based > on requests. After briefly browsing the proposals, it seems that there are > some differences. Can you help to do some explain ? Thanks 。 > > > Claude Warren, Jr via dev 于2023年9月25日周一 > 14:49写道: > >> I have just filed CEP-36 [1]

[DISCUSS] CEP-36: A Configurable ChannelProxy to alias external storage locations

2023-09-24 Thread Claude Warren, Jr via dev
I have just filed CEP-36 [1] to allow for keyspace/table storage outside of the standard storage space. There are two desires driving this change: 1. The ability to temporarily move some keyspaces/tables to storage outside the normal directory tree to other disk so that compaction can o

Fixes for UDF NPE during restart.

2023-08-15 Thread Claude Warren, Jr via dev
CASSANDRA-18739 describes a reproducible NPE on restart with some UDFs. The solution outlined in that ticket was not used and a much simpler solution provided by Stefan Miklosovic was implemented. There are 2 pull requests open for Cassandra 4.0 and 4.1 that have the fairly simple fix as well as a

Re: [DISCUSSION] Shall we remove ant javadoc task?

2023-08-03 Thread Claude Warren, Jr via dev
I think that we can get more developers interested if there are available javadocs. While many of the core classes are not going to be touched by someone just starting, being able to understand what the external touch points are and how they interact with other bits of the system can be invaluable

[DISCUSS] Tiered Storage

2023-07-24 Thread Claude Warren, Jr via dev
I have been thinking about tiered storage wherein infrequently used data can be moved off to slow (cold) storage (like S3). I think that CEP-17 in conjunction with CEP-21 provides an opportunity for an interesting approach. As I understand it CEP-17 clarified the SSTables interface(s) so that alt

Re: Bloom filter calculation

2023-07-11 Thread Claude Warren, Jr via dev
positive rate of a bloom > filter of 20 bits with 10 entries, which means only 2 bits per entry? > > On 11 Jul 2023, at 07:14, Claude Warren, Jr via dev < > dev@cassandra.apache.org> wrote: > >  > Can someone explain to me how the Bloom filter table in > BloomFilter

Bloom filter calculation

2023-07-10 Thread Claude Warren, Jr via dev
Can someone explain to me how the Bloom filter table in BloomFilterCalculations was derived and how it is supposed to work? As I read the table it seems to indicate that with 14 hashes and 20 bits you get a fp of 6.71e-05. But if you plug those numbers into the Bloom filter calculator [1], that

Re: [DISCUSS] Bring cassandra-harry in tree as a submodule

2023-05-25 Thread Claude Warren, Jr via dev
Since the talk was not accepted for Cassandra Summit, would it be possible to record it as a simple youtube video and publish it so that the detailed information about how to use Harry is not lost? On Thu, May 25, 2023 at 7:36 AM Alex Petrov wrote: > While we are at it, we may also want to pull

[DISCUSS] Standalone downgrader

2023-05-15 Thread Claude Warren, Jr via dev
I have an open pull request [1] to merge in a standalone downgrader. The problem has been that between v3 and v4 there was a breaking change in the system local table where the columns "broadcast_port", "listen_port", and "rpc_port" were added. The code (in the current pull request) provides fun

[COMPRESSION PARAMETERS] Question

2023-04-19 Thread Claude Warren, Jr via dev
Currently the compression parameters has an option called enable. When enable=false all the other options have to be removed. But it seems to me that we should support enabled=false without removing all the other parameters so that users can disable the compression for testing or problem resoluti

[DISCUSS] Initial implementation of cassandra-conf with nodetool example

2023-04-17 Thread Claude Warren, Jr via dev
The pull request [1] is a proposed fix for CASSANDRA-17773. I am looking for comments and a decision as to whether to move forward or not with this change. The goal is to remove much of the boiler-plate code from scripts without changing their functionality or arguments and to add the ability to

[DISCUSS] Moving standard boiler plate script blocks.

2023-03-22 Thread Claude Warren, Jr via dev
I would like to get some more eyes on https://github.com/apache/cassandra/pull/1950/files wich arises from CASSANDRA-17773 The basic idea is to: - Move the boiler plate script code to a single sourced file. - Add code to make debugging scripts easier, this in response to CASSANDRA-17773

Re: Downgradability

2023-02-23 Thread Claude Warren, Jr via dev
y the price for it. >> I simply prefer that we take the time to do things right. >> >> Thanks to Scott and you, downgradability got a much better visibility so >> no matter what approach we pick, I am convinced that we will get there. >> >> Le jeu. 23 févr.

[DISCUSS] Single boilerplate script

2023-02-23 Thread Claude Warren, Jr via dev
Pull request https://github.com/apache/cassandra/pull/1950/files is an attempt to move the boilerplate coding from the script files into a single maintainable file. This change does 4 things: 1. Moves the standard boiler plate from the standard scripts into a single maintainable script to b

Re: Downgradability

2023-02-23 Thread Claude Warren, Jr via dev
Broken downgrading can be fixed (I think) by modifying the SearializationHeader.toHeader() method where it currently throws an UnknownColumnException. If we can, instead of throwing the exception, create a dropped column for the unexpected column then I think the code will work. I realise that to

Re: Removing columns from sstables

2023-02-22 Thread Claude Warren, Jr via dev
Close. It is still in the table so the v3.x code that reads system.local will detect it and fail on an unknown column as that code appears to be looking at the actual on-disk format. It sounds like the short answer is that there is no way to physically remove the column from the on-disk format on

Removing columns from sstables

2023-02-22 Thread Claude Warren, Jr via dev
Greetings, I have been looking through the code and I can't find any place where columns are removed from an sstable. I have found that rows can be deleted. Columns can be marked as deleted. But I have found no place where the deleted cell is removed from the row. Is there the concept of comp

Re: Downgradability

2023-02-21 Thread Claude Warren, Jr via dev
My goal in implementing CASSANDRA-8928 was to be able to take the current version 4.x and write it as the earliest 3.x version possible. The reasoning being that if that was possible then whatever 3.x version was executed

downgrade sstables

2023-02-20 Thread Claude Warren, Jr via dev
I have been working on downgrading sstables for awhile now. I have the downgrader mostly working. The only issue is when downgrading system tables. Specifically during the 3.1 -> 4.0 changes a column broadcast_port was added to system/local. This means that 3.1 system can not read the table as

Re: Cassandra CI Status 2023-01-07

2023-02-10 Thread Claude Warren, Jr via dev
New Failures from Build Lead Week 5 *** CASSANDRA-18198 - "AttributeError: module 'py' has no attribute 'io'" reported in multiple tests - reported in 4.1, 3.11, and 3.0 - identified as a possible class loader issue associated with CASSANDRA-18150 *** CASSANDRA-18191 - Native Transport SSL tests

Re: [DISCUSSION] Framework for Internal Collection Exposure and Monitoring API Alignment

2023-01-30 Thread Claude Warren, Jr via dev
Actually, Maxim's proposal does not depend on JMX being present or not. What the proposal does is make it easier to create/sync multiple presentations of the same internal data: Virtual Tables, JMX, Metrics, next year's greatest data presentation strategy. Removing JMX from the mix just reduces

Re: [DISCUSSION] Cassandra's code style and source code analysis

2023-01-27 Thread Claude Warren, Jr via dev
Turn it on at warning (or lower) level now, so people have some idea of the size of change to their current code. On Wed, Jan 25, 2023 at 12:05 PM Miklosovic, Stefan < stefan.mikloso...@netapp.com> wrote: > Thank you Maxim for doing this. > > It is nice to see this effort materialized in a PR. >

Upgrading sstables and default partitioner.

2023-01-26 Thread Claude Warren, Jr via dev
Greetings, I am working on porting a fix for table upgrade order into V3.0 and have come across the following issue: ERROR 10:23:31 Cannot open /home/claude/apache/cassandra/build/test/cassandra/data/system_schema/keyspaces-abac5682dea631c5b535b3d6cffd0fb6/me-89-big; partitioner org.apache.cassan

Re: [DISCUSS] Clear rules about sstable versioning and downgrade support

2023-01-16 Thread Claude Warren, Jr via dev
What does this mean for the Trie sstable format? Would it perhaps make sense to version the sstable upgrader (and future downgrader) based on the highest version they understand? for example sstableupgrader version N will handle the n? versions so it can upgrade from m? while sstabledowngrader ve

Re: upgrade sstable selection

2023-01-10 Thread Claude Warren, Jr via dev
on > > On Tue, Jan 10, 2023 at 4:44 AM Claude Warren, Jr via dev > wrote: > > > > Greetings, > > > > I am working on the downgradesstables code and seem to have a problem > with ordering of the downgrade or perhaps the Directories.SSTableLister > > > >

upgrade sstable selection

2023-01-10 Thread Claude Warren, Jr via dev
Greetings, I am working on the downgradesstables code and seem to have a problem with ordering of the downgrade or perhaps the Directories.SSTableLister I lifted the code from upgradesstables to select the files to downgrade. The only difference in the code that selects the files to downgrade is

Weird results

2022-12-15 Thread Claude Warren, Jr via dev
I am working on a StandaloneDowngrader.java based on StandaloneUpgrader.java While working on the tests I had a problem with 2 test (testFlagArgs and testDefaultCall) that failed with: ERROR [main] 2022-12-14 10:35:20,051 SSTableReader.java:496 - Cannot open /home/claude/apache/cassandra/build/te

Re: Aggregate functions on collections, collection functions and MAXWRITETIME

2022-12-09 Thread Claude Warren, Jr via dev
th >>> the collection functions, so they can entirely replace MAXWRITE. >>> >>> However I wouldn't be against leaving the collection functions working >>> only on collections, as originally designed, and as they currently are on >>> trunk. The

Review requested: Add downgradesstables

2022-12-09 Thread Claude Warren, Jr via dev
https://github.com/apache/cassandra/pull/2045 https://issues.apache.org/jira/browse/CASSANDRA-8928 This is a work in progress and I am looking for some feedback. This fix appears to work correctly. But I think the placement of the v3 directory is probably not the best and perhaps should be moved

Re: [DISCUSSION] Cassandra's code style and source code analysis

2022-12-09 Thread Claude Warren, Jr via dev
StarImport rule on the documentation page [1]. > > > > [1] https://cassandra.apache.org/_/development/code_style.html > [2] https://issues.apache.org/jira/browse/CASSANDRA-17925 > [3] https://issues.apache.org/jira/browse/CASSANDRA-18089 > [4] https://github.com/apache/cassandra

Re: Aggregate functions on collections, collection functions and MAXWRITETIME

2022-12-08 Thread Claude Warren, Jr via dev
I think the semantics of the situation is important here. Let’s take MAX as our example aggregate function.. We all expect that in a DB context MAX(column) will return the value of the column with the maximum value. That is the expected semantics of MAX. The question here is that there are d

Re: [DISCUSSION] Cassandra's code style and source code analysis

2022-12-01 Thread Claude Warren, Jr via dev
cord, I was testing that same combo Claude mentioned and >>> it did not work out of the box but it is definitely possible to set up >>> successfully. I do not remember the details. >>> >> >>> >> To replay to Maxim, it all seems good to me, roughly, but I humbly &g

Re: [DISCUSSION] Cassandra's code style and source code analysis

2022-11-25 Thread Claude Warren, Jr via dev
+1 for the concept as a whole. I am certain I could find nits to pick if I looked deeply. @mck -- I did have a problem with Cassandra + Eclipse + Java11 (Classpath). I gave up and am spending time trying to learn IntelliJ. I also mentioned it in one of the discussion areas. Claude On Thu, Nov

Re: A proposal for refactoring the CircleCI config

2022-11-11 Thread Claude Warren, Jr via dev
I have been working on https://issues.apache.org/jira/projects/CASSANDRA/issues/CASSANDRA-18012 which modifies the generate.sh script for the circleci configurations. Perhaps all of this should be rolled into one change? On Fri, Nov 11, 2022 at 3:47 AM Ekaterina Dimitrova wrote: > Hey Derek, > T

Re: [DISSCUSS] Access to JDK internals only after dev mailing list consensus?

2022-11-07 Thread Claude Warren, Jr via dev
This change looks good to me. It is clear and concise. On Fri, Nov 4, 2022 at 9:50 PM Ekaterina Dimitrova wrote: > 👋 > > I finally got the chance to put down a proposal for a section at the end > of the Cassandra Code Style document. > Please help a fellow non-native speaker and definitely not

[DISCUSS] WIP pull request for CASSANDRA-17773

2022-10-26 Thread Claude Warren, Jr via dev
Greetings, I have created a pull request [1] as a work in progress for CASSANDRA-17773. The idea here is to move the boiler plate include of cassandra.in.sh and cassandra-env.sh into one sourced script that also provides a function (verify_env) to verify that required variables are set. verify_e

Re: CEP-23: Enhancement for Sparse Data Serialization

2022-10-25 Thread Claude Warren, Jr via dev
less it's > gone to a vote and/or a vote has passed. > > If someone else wants to pick it up later that's fine. > > On Tue, Oct 25, 2022, at 7:35 AM, Claude Warren, Jr via dev wrote: > > I would like to discard CEP-23. As I am the proposer, is a vote required? > > What is the process? > > Claude > > >

CEP-23: Enhancement for Sparse Data Serialization

2022-10-25 Thread Claude Warren, Jr via dev
I would like to discard CEP-23. As I am the proposer, is a vote required? What is the process? Claude

Re: CEP-21 and complete cluster replacement.

2022-10-20 Thread Claude Warren, Jr via dev
a good idea. > > Transactional metadata will make the bootstrapping (and decommissioning) > experience a whole lot more stable and predictable, so in the short term I > would expect the recommended rolling approach to upgrades would improve > significantly. > > > > On 20 Oct

CEP-21 and complete cluster replacement.

2022-10-20 Thread Claude Warren, Jr via dev
After CEP-21 would it be possible to take a cluster of 6 nodes, spin up 6 new nodes to duplicate the 6 existing nodes and then spin down the original 6 nodes. Basically, I am thinking of the case where a cluster is running version x.y.z and want to run x.y.z+1, can they spin up an equal number of

Re: [Discuss] CEP-24 Password validation and generation

2022-10-19 Thread Claude Warren, Jr via dev
Just to clarify, I have no objections to the current plan. On Thu, Oct 13, 2022 at 2:56 PM Claude Warren, Jr wrote: > I am not familiar with the Diagnostics framework but it sounds like it > would satisfy the need. Thanks for pointing it out. I will dive into it > to get an underst

Re: [Discuss] CEP-24 Password validation and generation

2022-10-13 Thread Claude Warren, Jr via dev
tics events would satisfy your needs? > > Regards > > ____ > From: Claude Warren, Jr via dev > Sent: Thursday, October 13, 2022 14:43 > To: dev@cassandra.apache.org > Subject: Re: [Discuss] CEP-24 Password validation and generation > > NetApp Security WARNING: This is an ex

Re: [Discuss] CEP-24 Password validation and generation

2022-10-13 Thread Claude Warren, Jr via dev
suggested would probably be sufficient." > > Yes, no problem with logging from Guardrail directly. > > (1) > https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-24%3A+Password+validation+and+generation#CEP24:Passwordvalidationandgeneration-Validationofanewpasswordagainstprev

Re: [Discuss] CEP-24 Password validation and generation

2022-10-13 Thread Claude Warren, Jr via dev
ctionaries. This might be included in > the CEP but I would keep it out for the very first implementation and it > can be finished afterwards in some other commit. I do not find it > absolutely necessary to do it right now. > > Regards, > > Stefan > > __

Fwd: [Discuss] CEP-24 Password validation and generation

2022-10-13 Thread Claude Warren, Jr via dev
I managed not to send this to the mailaing list... I don't know the govt spec. but there is a US govt security level where you are not allowed to inform the user why the login failed. It seems to me that there are 2 intertwined components being discussed. 1) A component to perform a user passw

Committer needed for Deprecate Throwables.propagate usage

2022-09-20 Thread Claude Warren via dev
I made the necessary fixes to remove the deprecated Throwables.propagate calls.  However, I need a committer to review. https://issues.apache.org/jira/browse/CASSANDRA-14218 Thank you, Claude

Re: [DISCUSS] CEP-23: Enhancement for Sparse Data Serialization

2022-09-07 Thread Claude Warren via dev
engage in discussion it’s unclear to me what advantage this encoding scheme brings. It might be worth outlining what algorithmic advantage you foresee for what data distributions in which collection types. On 6 Sep 2022, at 07:16, Claude Warren via dev wrote: I am just learning the ropes

Re: [DISCUSS] CEP-20: Dynamic Data Masking

2022-09-07 Thread Claude Warren via dev
but no one impedes us to also have 3) if we get to have projected views. However, I think that projected views is a new general-purpose feature with its own complexities, so it would deserve its own CEP, if someone is willing to work on the implementation. On Wed, 31 Aug 2

Re: [DISCUSS] CEP-23: Enhancement for Sparse Data Serialization

2022-09-05 Thread Claude Warren via dev
efficiency improvement. The CEP guidelines say a CEP is encouraged “for significant user-facing or changes that cut across multiple subsystems”. Any reason why a Jira isn’t sufficient? Abe On Sep 5, 2022, at 1:57 AM, Claude Warren via dev wrote: I have just posted a CEP covering an Enhancement

[DISCUSS] CEP-23: Enhancement for Sparse Data Serialization

2022-09-05 Thread Claude Warren via dev
I have just posted a CEP  covering an Enhancement for Sparse Data Serialzation.  This is in response to CASSANDRA-8959 I look forward to responses.

Re: [DISCUSS] LWT UPDATE semantics with + and - when null

2022-08-31 Thread Claude Warren via dev
I like this approach.  However, in light of some of the discussions on view and the like perhaps the function is  (column value as returned by select ) + 42 So a null counter column becomes 0 before the update calculation is applied. Then any null can be considered null unless addressed by IfN

Re: [DISCUSS] CEP-20: Dynamic Data Masking

2022-08-31 Thread Claude Warren via dev
Is there enough support here for VIEWS to be the implementation strategy for displaying masking functions? It seems to me the view would have to store the query and apply a where clause to it, so the same PK would be in play. It has data leaking properties. It has more use cases as it can be

Re: [DISCUSS] CEP-21: Transactional Cluster Metadata

2022-08-24 Thread Claude Warren, Jr via dev
Should (**) It may seem counterintuitive, that A is being written to even after > we've stopped reading from it. This is done in order to guarantee that by > the time we stop writing to the node giving up the range, there is no > coordinator that may attempt reading from it without learning about

Re: [DISCUSS] CEP-20: Dynamic Data Masking

2022-08-24 Thread Claude Warren, Jr via dev
queries before they were returned. On Wed, Aug 24, 2022 at 9:29 AM Claude Warren, Jr wrote: > The PCI DSS Standard v4_0 > <https://docs-prv.pcisecuritystandards.org/PCI%20DSS/Standard/PCI-DSS-v4_0.pdf> > requires > that credit card numbers stored on the system must be &quo

Re: [DISCUSS] CEP-20: Dynamic Data Masking

2022-08-24 Thread Claude Warren, Jr via dev
The PCI DSS Standard v4_0 requires that credit card numbers stored on the system must be "rendered unreadable", thus this proposal is _NOT_ a good way to protect credit card numbers. In fact, for any critically sensiti

Re: [DISCUSS] CEP-20: Dynamic Data Masking

2022-08-24 Thread Claude Warren, Jr via dev
This seems to me to be a client display filter, applied at the last moment as data are streaming back to the client. It has no impact on any keys, queries or secondary internal index or materialized view. It simply prevents the display from showing the complete value. It does not preclude determ

  1   2   >