Re: Indexing in hive

2012-05-16 Thread Ranjith
Thanks Mark. I going to give this a shot a get back with some results. Thanks, Ranjith On May 16, 2012, at 9:27 PM, Mark Grover wrote: > Hi Ranjith, > Here are the steps for using an index in Hive 0.7.1. > > 1) Create the index > CREATE INDEX x ON TABLE t(j) > AS 'org.apache.hadoop.hive.ql.i

Re: Indexing in hive

2012-05-16 Thread Mark Grover
Hi Ranjith, Here are the steps for using an index in Hive 0.7.1. 1) Create the index CREATE INDEX x ON TABLE t(j) AS 'org.apache.hadoop.hive.ql.index.compact.CompactIndexHandler' WITH DEFERRED REBUILD; 2) Build the index (since you specified the 'DEFERRED REBUILD' flag in the create index sta

Re: Indexing in hive

2012-05-16 Thread Raghunath, Ranjith
Thanks Mark, Carl, and Ransom. I really appreciate the answers here. I am using Hive 0.7.1 and currently trying to create an index to help with performance associated to a particular where clause. I have not set any properties as mentioned below. I can try the options you listed below. Based on

Re: Indexing in hive

2012-05-16 Thread Mark Grover
Ransom, >From this JIRA (https://issues.apache.org/jira/browse/HIVE-1644), it looks >like automatic use of indexes using hive.optimize.index.filter was introduced >in Hive 0.8. However, Ranjith seems to be using Hive 0.7.1 which doesn't >support those properties. Ranjith, you need to set the ap

Re: Indexing in hive

2012-05-16 Thread Carl Steinbach
Hi Ranjith, Hive 0.7 supports the ability to build indexes, but the query compiler in 0.7 doesn't know how to optimize queries with these indexes. Hive 0.8 was the first release to include some support for optimizing query plans with indexes, and that only applies to GROUP BY and WHERE clauses und

RE: Indexing in hive

2012-05-16 Thread Hezhiqiang (Ransom)
“hive.optimize.index.filter” is the conf automatically use indexes If u set hive.optimize.index.groupby = true. It will set hive.optimize.index.filter=false. See your configurations. And you need to build index after create index. Best regards Ransom.

Exception with datanucleus while running hive tests in eclipse

2012-05-16 Thread kulkarni.swar...@gmail.com
I installed datanucleus eclipse plugin as I realized that it is needed to run some of the hive tests in eclipse. While trying to run the enhancer tool, I keep getting this exception: "Exception occurred executing command line. Cannot run program "/System/Library/Java/JavaVirtualMachines/1.6.0.jdk/

Indexing in hive

2012-05-16 Thread Raghunath, Ranjith
I am currently using hive 0.7.1 and creating indexes based on columns in the where clause. However, when I run the explain plan I do not see the index being leveraged. The syntax that I am using to build the index is as follows: CREATE INDEX x ON TABLE t(j) AS 'org.apache.hadoop.hive.ql.index.c

Re: Order by Sort by partitioned columns

2012-05-16 Thread Tim Havens
I frequently sort by partitioned columns, without issues. Post your table schema, and your query that's failing, lets see what's going on? Tim On Mon, May 14, 2012 at 1:28 AM, Shin Chan wrote: > Hi All > > Just curious if its possible to Order by or Sort by partitioned columns. > > I tried it