Can one use bucketing in hive to emulate hash partitions on a database? Is 
there also a way to segment data into buckets dynamically based on values in 
the column. For example,

Col1                       Col2
Apple                    1
Orange                 2
Apple                    2
Banana                 1

If the file above were inserted into a table with Col1 as the bucket column, 
can we dynamically allow all of the rows with “Apple” in one file and “Orange” 
in one file and so on. Is there a way to do this without specifying the bucket 
size to be 3.
Thank you,
Ranjith

Reply via email to