Hi, I have a question related to hive table *bucketing* based on multiple columns(*Clustered by* on a common set of columns).
How would be the join performance if I am joining this table to itself based on few columns that I have specified in *clustered by *condition(not all)? Will the hashing differs based on few columns vs using all columns that I specified in the *Clustered by* clause on a table? Regards Krish