That's right.  Hive can use an HBase table as an input format to the
hive query regardless of output format, and can also write the output
to an HBase table regardless of the input format.  You can also
supposedly do a join in Hive that uses 1 side of the join from an
HBase table, and the other side a text file, which is very powerful.
I haven't done it myself, but intend to shortly.

HTH,
Tim


On Wed, Oct 13, 2010 at 10:07 PM, Otis Gospodnetic
<otis_gospodne...@yahoo.com> wrote:
> Hi,
>
> I was wondering how I can query data stored in HBase and remembered Hive's 
> HBase
> integration:
> http://wiki.apache.org/hadoop/Hive/HBaseIntegration
>
> After watching John Sichi's video
> (http://developer.yahoo.com/blogs/hadoop/posts/2010/04/hundreds_of_hadoop_fans_at_the/
>  ) I have a better idea about what functionality this integration provides, 
> but
> I still have some questions.
>
> Would it be correct to say that Hive-HBase integration makes the following 
> data
> flow possible:
>
> 0) Hive or Files => Custom HQL statement that aggregates data  ==> HBase
> 1) HBase ==> Custom HQL statement that aggregates data  ==> HBase
> 2) HBase ==> Custom HQL statement that aggregates data  ==> output (console?)
>
> Of the above, 1) is what I'm wondering the most about right now.
>
> In other words, it seems to me that Hive may be able to look at *just* data
> stored in HBase *without* the typical data/files in HDFS that Hive normally 
> runs
> its MR jobs against.
>
> Is this correct?
>
> Thanks,
> Otis
> ----
> Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
> Hadoop ecosystem search :: http://search-hadoop.com/
>
>

Reply via email to