[ https://issues.apache.org/jira/browse/HIVE-12878?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15136224#comment-15136224 ]
Matt McCline commented on HIVE-12878: ------------------------------------- Patch submitted is an experiment: Changed the default on these environment variables (temporarily) to force vectorization of many queries. {code} hive.fetch.task.conversion=none hive.vectorized.execution.enabled=true {code} New environment variables are set so that all vectorized queries either use the new vectorized versions of deserialize for LazySimple (i.e. TEXTFILE) and LazyBinarySerDe. Or, we deserialize row-by-row to fill up VectorizedRowBatch. {code} hive.vectorized.use.vectorized.input.format=false hive.vectorized.use.vector.serde.deserialize=true hive.vectorized.use.row.serde.deserialize=true {code} So, MapWork tasks not vectorizing due to input file format should not happen (except for ACID which only is permitted for vectorized input format...). > Support Vectorization for TEXTFILE and other formats > ---------------------------------------------------- > > Key: HIVE-12878 > URL: https://issues.apache.org/jira/browse/HIVE-12878 > Project: Hive > Issue Type: New Feature > Components: Hive > Reporter: Matt McCline > Assignee: Matt McCline > Priority: Critical > Attachments: HIVE-12878.01.patch > > > Support vectorizing when the input format is TEXTFILE and other formats for > better Map Vertex performance. -- This message was sent by Atlassian JIRA (v6.3.4#6332)