The table can be external. You should be able to use this data with other
tools, because all bucketing does is ensure that all occurrences for
records with a given key are written into the same block. This is why
clustered/blocked data can be joined on those keys using map-side joins;
Hive knows it
Thanks, Dean.
Does that mean, this bucketing is exclusively Hive feature and not
available to others like Java, Pig, etc?
And also, my final tables have to be managed tables; not external tables,
right?
.
Thank again for your time and help.
Sadu
On Fri, Mar 29, 2013 at 5:57 PM, Dean Wampler