You can put an Phoenix view <http://phoenix.apache.org/views.html> on top of your existing HBase table, then use Phoenix's Spark module <http://phoenix.apache.org/phoenix_spark.html> to read the table into a dataframe.
On Fri, Mar 25, 2016 at 12:44 PM, Benjamin Kim <[email protected]> wrote: > The hbase-spark module is still a work in progress in terms of Spark SQL. > All the RDD methods are complete and ready to use against the current > version of HBase 1.0+, but the use of DataFrames will require the > unreleased version of HBase 2.0. Fortunately, there is work in progress to > back-port the hbase-spark module to not have these deep rooted dependencies > on HBase 2.0 (HBASE-14160). For more information on this, you can refer > to > http://blog.cloudera.com/blog/2015/08/apache-spark-comes-to-apache-hbase-with-hbase-spark-module/ > to see what they are trying to accomplish. > > On Mar 25, 2016, at 9:17 AM, Silvio Fiorito <[email protected]> > wrote: > > There’s also this, which seems more current: > https://github.com/apache/hbase/tree/master/hbase-spark > > I haven’t used it, but I know Ted Malaska and others from Cloudera have > worked heavily on it. > > From: Felix Cheung <[email protected]> > Reply-To: "[email protected]" < > [email protected]> > Date: Friday, March 25, 2016 at 12:01 PM > To: "[email protected]" < > [email protected]>, "[email protected]" > <[email protected]> > Subject: Re: Importaing Hbase data > > You should be able to access that from Spark SQL through a package like > http://spark-packages.org/package/Huawei-Spark/Spark-SQL-on-HBase > > This package seems like have not been updated for a while though. > > > > On Tue, Mar 22, 2016 at 11:06 AM -0700, "Kumiko Yada" < > [email protected]> wrote: > > Hello, > > > Is there a way to importing Hbase data to the Zeppelin notebook using the > Spark SQL? > > > Thanks > Kumiko > > >
