Depending on the query, Hive on Spark does implicitly cache datasets (not
necessarily the input tables) for performance benefits. Such queries
include multi-insert, self-join, self-union, etc. However, no caching
happens across queries at this time, which may be improved in the future.
Thanks,
Xue
onsibility.
From: Udit Mehta [mailto:ume...@groupon.com]
Sent: 02 December 2015 23:43
To: user@hive.apache.org
Subject: Re: Hive on spark table caching
Im using Spark 1.3 with Hive 1.2.1. I dont mind using a version of Spark higher
than that but I read somewhere that 1.3 is the version of
ed by Peridale Technology
> Ltd, its subsidiaries or their employees, unless expressly so stated. It is
> the responsibility of the recipient to ensure that this email is virus
> free, therefore neither Peridale Ltd, its subsidiaries nor their employees
> accept any responsibility.
>
&g
hive.apache.org
Subject: Hive on spark table caching
Hi,
I have started using Hive on Spark recently and am exploring the benefits it
offers. I was wondering if Hive on Spark has capabilities to cache table like
Spark SQL. Or does it do any form of implicit caching in the long running job
which i
Hi,
I have started using Hive on Spark recently and am exploring the benefits
it offers. I was wondering if Hive on Spark has capabilities to cache table
like Spark SQL. Or does it do any form of implicit caching in the long
running job which it starts after running the first query?
Thanks,
Udit