Re: Hive on spark table caching

2015-12-02 Thread Xuefu Zhang
Depending on the query, Hive on Spark does implicitly cache datasets (not necessarily the input tables) for performance benefits. Such queries include multi-insert, self-join, self-union, etc. However, no caching happens across queries at this time, which may be improved in the future. Thanks, Xue

RE: Hive on spark table caching

2015-12-02 Thread Mich Talebzadeh
onsibility. From: Udit Mehta [mailto:ume...@groupon.com] Sent: 02 December 2015 23:43 To: user@hive.apache.org Subject: Re: Hive on spark table caching Im using Spark 1.3 with Hive 1.2.1. I dont mind using a version of Spark higher than that but I read somewhere that 1.3 is the version of

Re: Hive on spark table caching

2015-12-02 Thread Udit Mehta
ed by Peridale Technology > Ltd, its subsidiaries or their employees, unless expressly so stated. It is > the responsibility of the recipient to ensure that this email is virus > free, therefore neither Peridale Ltd, its subsidiaries nor their employees > accept any responsibility. > &g

RE: Hive on spark table caching

2015-12-02 Thread Mich Talebzadeh
hive.apache.org Subject: Hive on spark table caching Hi, I have started using Hive on Spark recently and am exploring the benefits it offers. I was wondering if Hive on Spark has capabilities to cache table like Spark SQL. Or does it do any form of implicit caching in the long running job which i

Hive on spark table caching

2015-12-02 Thread Udit Mehta
Hi, I have started using Hive on Spark recently and am exploring the benefits it offers. I was wondering if Hive on Spark has capabilities to cache table like Spark SQL. Or does it do any form of implicit caching in the long running job which it starts after running the first query? Thanks, Udit