[ https://issues.apache.org/jira/browse/HIVE-17669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16190622#comment-16190622 ]
Mithun Radhakrishnan commented on HIVE-17669: --------------------------------------------- bq. weight based eviction could be a better approach (weight can be length of string). Ah, that's an interesting suggestion. Shouldn't we also consider the cost of deserializing the sarg-string? On the one hand, perhaps the longer sarg-strings take longer to deserialize, and might benefit from caching. But on the other, they might dominate the cache. :/ I'll have to think this through. Any recommendation on the value for {{CacheBuilder.maximumWeight()}}? :] > Cache to optimize SearchArgument deserialization > ------------------------------------------------ > > Key: HIVE-17669 > URL: https://issues.apache.org/jira/browse/HIVE-17669 > Project: Hive > Issue Type: Improvement > Components: ORC, Query Processor > Affects Versions: 2.2.0, 3.0.0 > Reporter: Mithun Radhakrishnan > Assignee: Mithun Radhakrishnan > Attachments: HIVE-17699.1.patch, HIVE-17699.2.patch > > > And another, from [~selinazh] and [~cdrome]. (YHIVE-927) > When a mapper needs to process multiple ORC files, it might land up having > use essentially the same {{SearchArgument}} over several files. It would be > good not to have to deserialize from string, over and over again. Caching the > object against the string-form should speed things up. -- This message was sent by Atlassian JIRA (v6.4.14#64029)