Make Hive database data center aware
------------------------------------
Key: HIVE-1820
URL: https://issues.apache.org/jira/browse/HIVE-1820
Project: Hive
Issue Type: New Feature
Reporter: Ning Zhang
Assignee: Ning Zhang
In order to support multiple data centers (different DFS, MR clusters) for
hive, it is desirable to extend Hive database to be data center aware.
Currently Hive database is a logical concept and has no DFS or MR cluster info
associated with it. Database has the location property indicating the default
warehouse directory, but user cannot specify and change it. In order to make it
data center aware, the following info need to be maintained:
1) data warehouse root location which is the default HDFS location for newly
created tables (default=hive.metadata.warehouse.dir).
2) scratch dir which is the HDFS location where MR intermediate files are
created (default=hive.exec.scratch.dir)
3) MR job tracker URI that jobs should be submitted to
(default=mapred.job.tracker)
4) hadoop (bin) dir ($HADOOP_HOME/bin/hadoop)
These parameters should be saved in database.parameters (key, value) pair and
they overwrite the jobconf parameters (so if the default database has no
parameter it will get it from the hive-default.xml or hive-site.xml as it is
now).
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.