[jira] [Commented] (CASSANDRA-18673) Reduce size of per-SSTable index components

Mike Adamson (Jira) Tue, 01 Aug 2023 07:47:46 -0700


    [ 
https://issues.apache.org/jira/browse/CASSANDRA-18673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17749857#comment-17749857
 ]


Mike Adamson commented on CASSANDRA-18673:
------------------------------------------

[~maedhroz] I have attached a new PR to this ticket. This patch does the 
following:
 * Removes the primary key trie on-disk component
 * Adds a partition sizes on-disk component
 * Adds a partitionedSeekToTerm to SortedTermsReader.Cursor
 * Creates separate SkinnyRowAwarePrimaryKeyMap and WideRowAwarePrimaryKeyMap 
components

 

> Reduce size of per-SSTable index components
> -------------------------------------------
>
>                 Key: CASSANDRA-18673
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-18673
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Feature/SAI
>            Reporter: Mike Adamson
>            Assignee: Mike Adamson
>            Priority: Urgent
>          Time Spent: 6.5h
>  Remaining Estimate: 0h
>
> The current per-SSTable index components are large because the primary keys 
> that are stored in them include the token as part of the byte comparable. The 
> byte comparable puts the token first meaning that we get very little prefix 
> compression from either the trie or the sorted terms store. 
> We can fix this by removing the token from the primary key serialization. 
> This would allow us to get the prefix compression from the trie and the 
> sorted terms store.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (CASSANDRA-18673) Reduce size of per-SSTable index components

Reply via email to