[ 
https://issues.apache.org/jira/browse/HIVE-14511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15417732#comment-15417732
 ] 

Subramanyam Pattipaka commented on HIVE-14511:
----------------------------------------------

[~sershe], some users have their large data in structure with format 
data/partlevel=0/partlevel2=0/partlevel3=0/partleve4=0/..../partleveln=0/file1

Given this structure, using configs mapred.input.dir.recursive and 
hive.mapred.supports.subdirectories set to true, the expectation is that we can 
create partitions at any level and query data. 

Users can generate data considering various tools in mind. Asking them to 
reorganize data and create a copy for Hive may put hurdle for trying out Hive 
as data could be very huge and it may not always be possible.

This fix will ensure that we add appropriate partitions for above case when 
user tries to create partitions with any number of levels.

> Improve MSCK for partitioned table to deal with special cases
> -------------------------------------------------------------
>
>                 Key: HIVE-14511
>                 URL: https://issues.apache.org/jira/browse/HIVE-14511
>             Project: Hive
>          Issue Type: Sub-task
>            Reporter: Pengcheng Xiong
>            Assignee: Pengcheng Xiong
>         Attachments: HIVE-14511.01.patch
>
>
> Some users will have a folder rather than a file under the last partition 
> folder. However, msck is going to search for the leaf folder rather than the 
> last partition folder. We need to improve that.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to