Block merge for RCFile
----------------------
Key: HIVE-1950
URL: https://issues.apache.org/jira/browse/HIVE-1950
Project: Hive
Issue Type: New Feature
Reporter: He Yongqiang
Assignee: He Yongqiang
In our env, there are a lot of small files inside one partition/table. In order
to reduce the namenode load, we have one dedicated housekeeping job running to
merge these file. Right now the merge is an 'insert overwrite' in hive, and
requires decompress the data and compress it. This jira is to add a command in
Hive to do the merge without decompress and recompress the data.
Something like "alter table tbl_name [partition ()] merge files". In this jira
the new command will only support RCFile, since there need some new APIs to the
fileformat.
--
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira