[
https://issues.apache.org/jira/browse/CASSANDRA-17021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yifan Cai updated CASSANDRA-17021:
----------------------------------
Authors: Yifan Cai (was: Yifan Cai)
Test and Documentation Plan: unit test
Status: Patch Available (was: In Progress)
PR: https://github.com/apache/cassandra/pull/4399
Patch Summary
- Core Implementation: ZstdDictionaryCompressor, CompressionDictionaryManager,
trainer, scheduler, and cache components
- Integration: Modified SSTable writers, compression metadata, and schema
handling to support dictionary-based compression
- Management: Added nodetool command TrainCompressionDictionary and JMX
interface for dictionary management
- Configuration: Extended CompressionParams and table schema to include
dictionary training parameters
- Test coverage: new unit tests and integration tests for compression and
dictionary management.
> Enhance Zstd support in Cassandra with dictionaries
> ---------------------------------------------------
>
> Key: CASSANDRA-17021
> URL: https://issues.apache.org/jira/browse/CASSANDRA-17021
> Project: Apache Cassandra
> Issue Type: Improvement
> Components: Feature/Compression
> Reporter: Dinesh Joshi
> Assignee: Yifan Cai
> Priority: Normal
>
> Currently Cassandra supports zstd compression. However, Zstd also supports
> dictionaries to enhance not only the compression ratio but also the speed.
> Dictionaries can show 3-4x savings. We should add support to train
> dictionaries, ideally per SSTable this will yield the maximum gains.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]