[ https://issues.apache.org/jira/browse/HIVE-14451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15454101#comment-15454101 ]
Matt McCline commented on HIVE-14451: ------------------------------------- This should improve performance for TEXT (LazySimple) and non-TEXT (BinarySortable) [~gopalv] [~ndembla] Thank you Gopal for the observing this improvement possibility. > Vectorization: Add byRef mode for borrowed Strings in VectorDeserializeRow > -------------------------------------------------------------------------- > > Key: HIVE-14451 > URL: https://issues.apache.org/jira/browse/HIVE-14451 > Project: Hive > Issue Type: Improvement > Components: Vectorization > Reporter: Gopal V > Assignee: Matt McCline > Attachments: HIVE-14451.01.patch, HIVE-14451.02.patch > > > In a majority of cases, when using the OptimizedHashMap, the references to > the byte[] are immutable. > The hashmap result always allocates on boundary conditions, but never mutates > a previous buffer. > Copying Strings out of the hashtable is entirely wasteful and it would be > easy to know when the currentBytes is a borrowed slice from the original > input. -- This message was sent by Atlassian JIRA (v6.3.4#6332)