[ 
https://issues.apache.org/jira/browse/KUDU-3367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17649548#comment-17649548
 ] 

ASF subversion and git services commented on KUDU-3367:
-------------------------------------------------------

Commit ad920e69fcd67ceefa25ea81a38a10a27d9e3afc in kudu's branch 
refs/heads/master from kedeng
[ https://gitbox.apache.org/repos/asf?p=kudu.git;h=ad920e69f ]

KUDU-3367 [compaction] add supplement to gc algorithm

If we get a REDO delta full of delete ops, which means there is
not a single update operation in the delta. The current compaction
algorithm doesn't run GC on such deltamemstores. The accumulation
of deltamemstores like that negatively affects performance of scan
operations.

This patch as a supplement to KUDU-1625, we could  release storage
space for old tablet metadata that does not support the live count
function. See KUDU-3367 for details.

Change-Id: I8b26737dffecc17688b42188da959b2ba16351ed
Reviewed-on: http://gerrit.cloudera.org:8080/18503
Reviewed-by: Alexey Serbin <ale...@apache.org>
Tested-by: Alexey Serbin <ale...@apache.org>


> Delta file with full of delete op can not be schedule to compact
> ----------------------------------------------------------------
>
>                 Key: KUDU-3367
>                 URL: https://issues.apache.org/jira/browse/KUDU-3367
>             Project: Kudu
>          Issue Type: New Feature
>          Components: compaction
>            Reporter: dengke
>            Assignee: dengke
>            Priority: Major
>         Attachments: image-2022-05-09-14-13-16-525.png, 
> image-2022-05-09-14-16-31-828.png, image-2022-05-09-14-18-05-647.png, 
> image-2022-05-09-14-19-56-933.png, image-2022-05-09-14-21-47-374.png, 
> image-2022-05-09-14-23-43-973.png, image-2022-05-09-14-26-45-313.png, 
> image-2022-05-09-14-32-51-573.png, image-2022-11-14-11-02-33-685.png
>
>
> If we get a REDO delta with full of delete op, wich means there is no update 
> op in the file. The current compact algorithm will not schedule the file do 
> compact. If such files exist, after accumulating for a period of time, it 
> will greatly affect our scan speed. However, processing such files every time 
> compact reducesĀ  compact's performance.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to