BELUGA BEHR created HIVE-18603: ---------------------------------- Summary: Use Hash For Partition HDFS File Path Key: HIVE-18603 URL: https://issues.apache.org/jira/browse/HIVE-18603 Project: Hive Issue Type: Improvement Components: HiveServer2 Affects Versions: 2.3.0, 1.2.0, 3.0.0, 2.4.0 Reporter: BELUGA BEHR
Currently, for partitioned tables, Hive uses the literal value of each partition in the HDFS file path. Instead, perhaps we can use a hash value so that: # The partitioned values are obscured to a casual observer in HDFS # Remove the chance of having a very long HDFS file name when faced with a very long partitioned value # Remove the needs to worry about special characters in the partitioned path name as the hash value would only be HEX string values. The suggestion here is that we retain the partition values, just as is done now, but the default HDFS location for each partition will use the hash of the value instead of the value itself. -- This message was sent by Atlassian JIRA (v7.6.3#76005)