hi, Raajay.

https://issues.apache.org/jira/browse/HIVE-7313 provides a potential
solutions to store intermediate data into Memory/SSD. But it relies on the
hdfs feature of multiple StorageType (
https://hadoop.apache.org/docs/stable/hadoop-project-dist/hadoop-hdfs/ArchivalStorage.html

2015-12-02 5:17 GMT+08:00 Raajay <raaja...@gmail.com>:

> Hello,
>
> My setup is Hive on Tez.  I find that for most of my queries, the map
> stage takes the longest. Is it possible to use the Tez Shared Object
> Registry to cache the intermediate data to improve performance of recurring
> queries ?
>
> If yes, how would I do it ? Assuming that the nodes I run on have
> sufficient RAM to store all intermediate data.
>
> Raajay
>



-- 
Bing Jiang

Reply via email to