[ https://issues.apache.org/jira/browse/HIVE-15664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15831527#comment-15831527 ]
Matt McCline commented on HIVE-15664: ------------------------------------- So, looks like you have a sparse column input VRB from the table but you need to cache the data ORC style with non sparse so you share columns with the destination (write) VRB. > LLAP text cache: improve first query perf I > ------------------------------------------- > > Key: HIVE-15664 > URL: https://issues.apache.org/jira/browse/HIVE-15664 > Project: Hive > Issue Type: Bug > Reporter: Sergey Shelukhin > Assignee: Sergey Shelukhin > Attachments: HIVE-15664.patch, HIVE-15664.WIP.patch > > > 1) Don't use ORC dictionary. > 2) Use VectorDeserialize. > 3) Don't parse the columns that are not included (cannot avoid reading them). > -4) Send VRB to the pipeline and write ORC in parallel (in background)-. > HIVE-15672 > Also add an option to disable the encoding pipeline server-side. -- This message was sent by Atlassian JIRA (v6.3.4#6332)