[ 
https://issues.apache.org/jira/browse/HIVE-15129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rajesh Balamohan updated HIVE-15129:
------------------------------------
    Description: 
When multiple queries are run in LLAP, stripe metadata cache misses were 
observed even though enough memory was available. 
https://github.com/apache/hive/blob/master/llap-server/src/java/org/apache/hadoop/hive/llap/io/encoded/OrcEncodedDataReader.java#L655.
 Even in cases when data was found in cache, it wasn't getting used as 
{{globalnc}} changed from query to query.  This 
would be lot more beneficial in cloud storage where opening and reading small 
of data can be expensive compared to HDFS. Creating a superset of existing 
indexes with {{globalInc}} would be helpful.

  was:
When multiple queries are run in LLAP, stripe metadata cache misses were 
observed even though enough memory was available. 
https://github.com/apache/hive/blob/master/llap-server/src/java/org/apache/hadoop/hive/llap/io/encoded/OrcEncodedDataReader.java#L655.
 Even though data would be found in cache, it wasn't getting used as 
{{globalnc}} changed from query to query.  This 
would be lot more beneficial in cloud storage where opening and reading small 
of data can be expensive compared to HDFS. Creating a superset of existing 
indexes with {{globalInc}} would be helpful.


> LLAP : Enhance cache hits for stripe metadata across queries
> ------------------------------------------------------------
>
>                 Key: HIVE-15129
>                 URL: https://issues.apache.org/jira/browse/HIVE-15129
>             Project: Hive
>          Issue Type: Improvement
>          Components: llap
>            Reporter: Rajesh Balamohan
>            Priority: Minor
>         Attachments: HIVE-15129.1.patch
>
>
> When multiple queries are run in LLAP, stripe metadata cache misses were 
> observed even though enough memory was available. 
> https://github.com/apache/hive/blob/master/llap-server/src/java/org/apache/hadoop/hive/llap/io/encoded/OrcEncodedDataReader.java#L655.
>  Even in cases when data was found in cache, it wasn't getting used as 
> {{globalnc}} changed from query to query.  This 
> would be lot more beneficial in cloud storage where opening and reading small 
> of data can be expensive compared to HDFS. Creating a superset of existing 
> indexes with {{globalInc}} would be helpful.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to