hello users: (apache-hive-3.1.1-bin) I have a problem.The bucket file is already produced, but the contents of the file have nothing to do with hash_function(bucketing_column) mod num_buckets and appear to be randomly allocated. Below are the statements to create the table, the bucket files, and the query results. +----------------------------------------------------+ | createtab_stmt | +----------------------------------------------------+ | CREATE TABLE `students`( | | `name` varchar(64), | | `age` int, | | `gpa` decimal(3,2)) | | CLUSTERED BY ( | | age) | | INTO 3 BUCKETS | | ROW FORMAT SERDE | | 'org.apache.hadoop.hive.ql.io.orc.OrcSerde' | | STORED AS INPUTFORMAT | | 'org.apache.hadoop.hive.ql.io.orc.OrcInputFormat' | | OUTPUTFORMAT | | 'org.apache.hadoop.hive.ql.io.orc.OrcOutputFormat' | | LOCATION | | 'hdfs://sicluster/hive/warehouse/inputdb.db/students' | | TBLPROPERTIES ( | | 'bucketing_version'='2', | | 'transient_lastDdlTime'='1564647129') | +----------------------------------------------------+
My English is not very good, I hope you can understand it.Best regards, Li Wei