minihippo commented on pull request #4785:
URL: https://github.com/apache/hudi/pull/4785#issuecomment-1041505103


   > 
   
   
   
   > @minihippo yes will do it, but i still have some problems, 
[BucketIdentifier.java](https://github.com/apache/hudi/pull/4785/files#diff-cec97cf4da631039da992f4b6d21424c18a39ced051e2c2faf3739b40b9ebec6)
 will give a hash value for recordKey, Why don't we just use the hash value of 
recordkey ?
   
   Hudi is commonly used to store data warehouse summary tables at ByteDance. 
Summary table is organized with dimensions and indexes. Id and dimension is 
composite primary key. Bucket key can be one of the dimension which is 
frequently aggregated by data analytics to accelerated query. In the above 
scenario, the primary key is a complex type: id and dimension column.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to