[ https://issues.apache.org/jira/browse/HIVE-17669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16205020#comment-16205020 ]
Lefty Leverenz commented on HIVE-17669: --------------------------------------- The new configuration parameter is documented in the wiki here (thanks, Mithun): * [hive.io.sarg.cache.max.weight.mb | https://cwiki.apache.org/confluence/display/Hive/Configuration+Properties#ConfigurationProperties-hive.io.sarg.cache.max.weight.mb] But the fix versions should include 2.3.1. I've changed that in the wiki and made some trivial edits. > Cache to optimize SearchArgument deserialization > ------------------------------------------------ > > Key: HIVE-17669 > URL: https://issues.apache.org/jira/browse/HIVE-17669 > Project: Hive > Issue Type: Improvement > Components: ORC, Query Processor > Affects Versions: 2.2.0, 3.0.0 > Reporter: Mithun Radhakrishnan > Assignee: Mithun Radhakrishnan > Fix For: 3.0.0, 2.4.0, 2.2.1 > > Attachments: HIVE-17669.3.patch, HIVE-17669.4.patch, > HIVE-17699.1.patch, HIVE-17699.2.patch > > > And another, from [~selinazh] and [~cdrome]. (YHIVE-927) > When a mapper needs to process multiple ORC files, it might land up having > use essentially the same {{SearchArgument}} over several files. It would be > good not to have to deserialize from string, over and over again. Caching the > object against the string-form should speed things up. -- This message was sent by Atlassian JIRA (v6.4.14#64029)