[ 
https://issues.apache.org/jira/browse/HIVE-21305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16774884#comment-16774884
 ] 

Peter Vary commented on HIVE-21305:
-----------------------------------

[~prasanth_j]: We decide if the query inserts into a table then we do not add 
entries to the cache, but we still use the existing cache elements?

What do you think about using the row number statistics for the input tables? 
ETL queries still might use/and reuse some smaller tables in joins. We might be 
better off caching the small tables but skipping the big ones.

Thanks,

Peter 

 

> LLAP: Option to skip cache for ETL queries
> ------------------------------------------
>
>                 Key: HIVE-21305
>                 URL: https://issues.apache.org/jira/browse/HIVE-21305
>             Project: Hive
>          Issue Type: Improvement
>          Components: llap
>    Affects Versions: 4.0.0
>            Reporter: Prasanth Jayachandran
>            Priority: Major
>
> To avoid ETL queries from polluting the cache, would be good to detect such 
> queries at compile time and optional skip llap io for such queries. 
> org.apache.hadoop.hive.ql.parse.QBParseInfo.hasInsertTables() is the simplest 
> way  to catch ETL queries.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to