[ 
https://issues.apache.org/jira/browse/HIVE-26685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17627846#comment-17627846
 ] 

James Petty commented on HIVE-26685:
------------------------------------

Since this is my first contribution to the Hive project, I'm not sure what the 
preferred process should be for targeting this change at the different 
applicable branches. Please let me know whether targeting the pull request at 
the master branch is sufficient or whether I need to take some extra steps to 
have this patch included in the other release branches.

> Improve Path name escaping / unescaping performance
> ---------------------------------------------------
>
>                 Key: HIVE-26685
>                 URL: https://issues.apache.org/jira/browse/HIVE-26685
>             Project: Hive
>          Issue Type: Improvement
>          Components: Hive
>    Affects Versions: All Versions
>            Reporter: James Petty
>            Priority: Minor
>              Labels: pull-request-available
>         Attachments: HIVE-26685.1.patch
>
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> When escaping / unescaping partition path part names, the existing logic 
> incurs significant avoidable overhead by copying each character sequentially 
> into a new StringBuilder even when no escaping/unescaping is necessary as 
> well as using String.format to escape characters inside of the inner loop.
>  
> The included patch to improve the performance of these operations refactors 
> two static method implementations, but requires no external API surface or 
> user-visible behavior changes. This change is applicable and portable to a 
> wide range of Hive versions from branch-0.6 onward when the initial method 
> implementations were added.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to