BELUGA BEHR created HIVE-18603:
----------------------------------

             Summary: Use Hash For Partition HDFS File Path
                 Key: HIVE-18603
                 URL: https://issues.apache.org/jira/browse/HIVE-18603
             Project: Hive
          Issue Type: Improvement
          Components: HiveServer2
    Affects Versions: 2.3.0, 1.2.0, 3.0.0, 2.4.0
            Reporter: BELUGA BEHR


Currently, for partitioned tables, Hive uses the literal value of each 
partition in the HDFS file path.  Instead, perhaps we can use a hash value so 
that:

 
 # The partitioned values are obscured to a casual observer in HDFS
 # Remove the chance of having a very long HDFS file name when faced with a 
very long partitioned value
 # Remove the needs to worry about special characters in the partitioned path 
name as the hash value would only be HEX string values.

 

The suggestion here is that we retain the partition values, just as is done 
now, but the default HDFS location for each partition will use the hash of the 
value instead of the value itself.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to