RE: Optimization on bucketized/sorted tables

2012-03-23 Thread mdefoinplatel.ext
:mgro...@oanda.com] Envoyé : jeudi 22 mars 2012 17:58 À : user@hive.apache.org Objet : Re: Optimization on bucketized/sorted tables Hi Michael, This JIRA is along the lines of your questions: https://issues.apache.org/jira/browse/HIVE-2846 The following is based on my understanding so take it with a gra

Re: Optimization on bucketized/sorted tables

2012-03-22 Thread Mark Grover
foinplatel ext" To: user@hive.apache.org Sent: Tuesday, March 20, 2012 10:19:41 AM Subject: Optimization on bucketized/sorted tables Hi folks, I have several questions about optimization in Hive, they are mainly related to bucketized/sorted tables. Let say I have a table T bucketize

Optimization on bucketized/sorted tables

2012-03-20 Thread mdefoinplatel.ext
Hi folks, I have several questions about optimization in Hive, they are mainly related to bucketized/sorted tables. Let say I have a table T bucketized on user_id and sorted by user_id, time. CREATE TABLE T ( user_id BIGINT, time INT ) CLUSTERED BY(user_id) SORTED BY(user_id, time) INTO 64 BUC