[ 
https://issues.apache.org/jira/browse/HIVE-18603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Peter Vary reassigned HIVE-18603:
---------------------------------

    Assignee:     (was: Peter Vary)

> Use Hash For Partition HDFS File Path
> -------------------------------------
>
>                 Key: HIVE-18603
>                 URL: https://issues.apache.org/jira/browse/HIVE-18603
>             Project: Hive
>          Issue Type: Improvement
>          Components: HiveServer2
>    Affects Versions: 1.2.0, 2.3.0, 2.4.0, 3.0.0
>            Reporter: David Mollitor
>            Priority: Minor
>
> Currently, for partitioned tables, Hive uses the literal value of each 
> partition in the HDFS file path.  Instead, perhaps we can use a hash value so 
> that:
>  
>  # The partitioned values are obscured to a casual observer in HDFS
>  # Remove the chance of having a very long HDFS file name when faced with a 
> very long partitioned value
>  # Remove the needs to worry about special characters in the partitioned path 
> name as the hash value would only be HEX string values.
>  
> The suggestion here is that we retain the partition values, just as is done 
> now, but the default HDFS location for each partition will use the hash of 
> the value instead of the value itself.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to