Thanks Mark, Carl, and Ransom. I really appreciate the answers here. I am using 
Hive 0.7.1 and currently trying to create an index to help with performance 
associated to a particular where clause. I have not set any properties as 
mentioned below. I can try the options you listed below. Based on the feedback, 
I do have a few questions:

1. It seems like if you optimize for a groupby you cannot optimize for the 
where clause. I am reading this correctly?

2. How do you build the index? 

3. Does the build process create a dataset that contains the index keys along 
with an offset value associated with the row? 

4. And I am guessing that you need to use the columns in the same order as it 
is defined in the index?


Thanks,
Ranjith      

----- Original Message -----
From: Mark Grover [mailto:mgro...@oanda.com]
Sent: Wednesday, May 16, 2012 07:52 PM
To: user@hive.apache.org <user@hive.apache.org>
Cc: Zhaojun (Terry) <terry.zhao...@huawei.com>
Subject: Re: Indexing in hive

Ransom,
From this JIRA (https://issues.apache.org/jira/browse/HIVE-1644), it looks like 
automatic use of indexes using hive.optimize.index.filter was introduced in 
Hive 0.8. However, Ranjith seems to be using Hive 0.7.1 which doesn't support 
those properties.

Ranjith, you need to set the appropriate priorities before calling your query 
to make use of indexes. Are you setting any properties? If so, what?
I will try to dig up what those properties are, in the meanwhile.

Mark

----- Original Message -----
From: "Hezhiqiang (Ransom)" <ransom.hezhiqi...@huawei.com>
To: user@hive.apache.org
Cc: "Zhaojun (Terry)" <terry.zhao...@huawei.com>
Sent: Wednesday, May 16, 2012 8:32:55 PM
Subject: RE: Indexing in hive




“ hive.optimize.index.filter ” is the conf automatically use indexes 

If u set hive.optimize.index.groupby = true. 

It will set hive.optimize.index.filter =false. 

See your configurations. 



And you need to build index after create index. 




Best regards 

Ransom. 

Reply via email to