minihippo edited a comment on pull request #4785: URL: https://github.com/apache/hudi/pull/4785#issuecomment-1041505103
> > @minihippo yes will do it, but i still have some problems, [BucketIdentifier.java](https://github.com/apache/hudi/pull/4785/files#diff-cec97cf4da631039da992f4b6d21424c18a39ced051e2c2faf3739b40b9ebec6) will give a hash value for recordKey, Why don't we just use the hash value of recordkey ? @xiarixiaoyao Hudi is commonly used to store data warehouse summary tables at ByteDance. Summary table is organized with dimensions and indexes. Id and dimension is composite primary key. Bucket key can be one of the dimension which is frequently aggregated by data analytics to accelerated query. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org