hive table clustering - question

krish ws Thu, 24 Apr 2014 19:17:27 -0700

 Hi,
      I have a question related to hive table *bucketing* based on multiple
columns(*Clustered by* on a common set of columns).


How would be the join performance if I am joining this table to itself
based on few columns that I have specified in *clustered by *condition(not
all)?

Will the hashing differs based on few columns vs using all columns that I
specified in the *Clustered by* clause on a table?

Regards
Krish

hive table clustering - question

Reply via email to