Please take a look at hive with tez as execution engine on hadoop 2.3. it may help you compare it with what you want to achieve
On Fri, Aug 1, 2014 at 4:13 PM, Uli Bethke <uli.bet...@sonra.io> wrote: > Hi. > > in Hive can I make use of the centralized cache management introduced in > Hadoop 2.3 ( > http://hadoop.apache.org/docs/r2.3.0/hadoop-project-dist/hadoop-hdfs/CentralizedCacheManagement.html)? > If not implemented yet, is this on the roadmap? > > My use case is that I want to pin a fact table that needs to be queried > frequently into memory. > > Impala already supports this as per the Cloudera documentation > http://www.cloudera.com/content/cloudera-content/cloudera-docs/Impala/latest/Installing-and-Using-Impala/ciiu_perf_hdfs_caching.html > > Thanks > uli > -- Nitin Pawar