Hi, I was wondering how I can query data stored in HBase and remembered Hive's HBase
integration: http://wiki.apache.org/hadoop/Hive/HBaseIntegration After watching John Sichi's video (http://developer.yahoo.com/blogs/hadoop/posts/2010/04/hundreds_of_hadoop_fans_at_the/ ) I have a better idea about what functionality this integration provides, but I still have some questions. Would it be correct to say that Hive-HBase integration makes the following data flow possible: 0) Hive or Files => Custom HQL statement that aggregates data ==> HBase 1) HBase ==> Custom HQL statement that aggregates data ==> HBase 2) HBase ==> Custom HQL statement that aggregates data ==> output (console?) Of the above, 1) is what I'm wondering the most about right now. In other words, it seems to me that Hive may be able to look at *just* data stored in HBase *without* the typical data/files in HDFS that Hive normally runs its MR jobs against. Is this correct? Thanks, Otis ---- Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Hadoop ecosystem search :: http://search-hadoop.com/