[jira] [Created] (HIVE-7292) Hive on Spark

Xuefu Zhang (JIRA) Wed, 25 Jun 2014 12:58:29 -0700

Xuefu Zhang created HIVE-7292:
---------------------------------

             Summary: Hive on Spark
                 Key: HIVE-7292
                 URL: https://issues.apache.org/jira/browse/HIVE-7292
             Project: Hive
          Issue Type: Improvement
            Reporter: Xuefu Zhang
            Assignee: Xuefu Zhang



Spark as an open-source data analytics cluster computing framework has gained 
significant momentum recently. Many Hive users already have Spark installed as 
their computing backbone. To take advantages of Hive, they still need to have 
either MapReduce or Tez on their cluster. This initiative will provide user a 
new alternative so that those user can consolidate their backend. 

Secondly, providing such an alternative further increases Hive's adoption as it 
exposes Spark users  to a viable, feature-rich de facto standard SQL tools on 
Hadoop.

Finally, allowing Hive to run on Spark also has performance benefits. Hive 
queries, especially those involving multiple reducer stages, will run faster, 
thus improving user experience as Tez does.

This is an umber JIRA which will cover many coming subtask. Design doc will be 
attached here shortly, and will be on the wiki as well. Feedback from the 
community is greatly appreciated!



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Created] (HIVE-7292) Hive on Spark

Reply via email to