Hi,
      I have a question related to hive table *bucketing* based on multiple
columns(*Clustered by* on a common set of columns).

How would be the join performance if I am joining this table to itself
based on few columns that I have specified in *clustered by *condition(not
all)?

Will the hashing differs based on few columns vs using all columns that I
specified in the *Clustered by* clause on a table?

Regards
Krish

Reply via email to