Re: Performance between Hive queries vs. Hive over HBase queries

2011-03-09 Thread Edward Capriolo
-- >> Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch >> Lucene ecosystem search :: http://search-lucene.com/ >> >> >> >> - Original Message >>> From: John Sichi >>> To: "" >>> Sent: Tue, March 8, 2011 1

Re: Performance between Hive queries vs. Hive over HBase queries

2011-03-09 Thread John Sichi
> > > - Original Message >> From: John Sichi >> To: "" >> Sent: Tue, March 8, 2011 1:05:34 AM >> Subject: Re: Performance between Hive queries vs. Hive over HBase queries >> >> Yes. >> >> JVS >> >> On M

Re: Performance between Hive queries vs. Hive over HBase queries

2011-03-09 Thread Otis Gospodnetic
http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message > From: John Sichi > To: "" > Sent: Tue, March 8, 2011 1:05:34 AM > Subject: Re: Performance between Hive queries vs. Hive over HBase queries > > Yes.

Re: Performance between Hive queries vs. Hive over HBase queries

2011-03-09 Thread John Sichi
d watch? > > Thanks, > Otis > > Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch > Lucene ecosystem search :: http://search-lucene.com/ > > > > - Original Message >> From: John Sichi >> To: "" >> Sent: Tue, March 8, 2011 1

Re: Performance between Hive queries vs. Hive over HBase queries

2011-03-08 Thread Otis Gospodnetic
://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message > From: John Sichi > To: "" > Sent: Tue, March 8, 2011 1:17:51 AM > Subject: Re: Performance between Hive queries vs. Hive over HBase queries > > F

Re: Performance between Hive queries vs. Hive over HBase queries

2011-03-07 Thread Vaibhav Aggarwal
If you are querying for particular key you should see better performance though. We have filter push-down for equals on hbase key column. On Mar 7, 2011 10:18 PM, "John Sichi" wrote:

Re: Performance between Hive queries vs. Hive over HBase queries

2011-03-07 Thread John Sichi
For native tables, Hive reads rows directly from HDFS. For HBase tables, it has to go through the HBase region servers, which reconstruct rows from column families (combining cache + HDFS). HBase makes it possible to keep your table up to date in real time, but you have to pay an overhead cost

Re: Performance between Hive queries vs. Hive over HBase queries

2011-03-07 Thread Biju Kaimal
Hi, Could you please explain the reason for the behavior? Regards, Biju On Tue, Mar 8, 2011 at 11:35 AM, John Sichi wrote: > Yes. > > JVS > > On Mar 7, 2011, at 9:59 PM, Biju Kaimal wrote: > > > Hi, > > > > I loaded a data set which has 1 million rows into both Hive and HBase > tables. For the

Re: Performance between Hive queries vs. Hive over HBase queries

2011-03-07 Thread John Sichi
Yes. JVS On Mar 7, 2011, at 9:59 PM, Biju Kaimal wrote: > Hi, > > I loaded a data set which has 1 million rows into both Hive and HBase tables. > For the HBase table, I created a corresponding Hive table so that the data in > HBase can be queried from Hive QL. Both tables have a key column an