[
https://issues.apache.org/jira/browse/KUDU-2725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16783828#comment-16783828
]
Will Berkeley commented on KUDU-2725:
-------------------------------------
One can workaround this problem by increasing the target rowset size
{{--budgeted_compaction_target_rowset_size}}.
> RollingDiskRowSetWriter create rowsets that are bigger than the target rowset
> size
> ----------------------------------------------------------------------------------
>
> Key: KUDU-2725
> URL: https://issues.apache.org/jira/browse/KUDU-2725
> Project: Kudu
> Issue Type: Improvement
> Affects Versions: 1.9.0
> Reporter: Will Berkeley
> Priority: Major
>
> The diskrowset writer create rowsets that are bigger than the target rowset
> size, with the excess proportional to the number of columns that compress
> poorly. For example, modifying loadgen to create a table with 280 columns and
> then using the {{--use_random}} flag, I saw rowsets that were in excess of
> 80MB. This is a problem because the budget for compactions is 128MB, so
> rowsets that are that big can never participate in a compaction.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)