I am trying to integrate SparkStreaming with HBase. I am calling following APIs to connect to HBase
HConnection hbaseConnection = HConnectionManager.createConnection(conf); hBaseTable = hbaseConnection.getTable(hbaseTable); Since I cannot get the connection and broadcast the connection each API call to get data from HBase is very expensive. I tried using JavaHBaseContext (JavaHBaseContext hbaseContext = new JavaHBaseContext(jsc, conf)) by using hbase-spark library in CDH 5.5 but I cannot import the library from maven. Has anyone been able to successfully resolve this issue. I am trying to use the latest APIs to connect HBase and SparkStreaming on Cloudera. Some of the JIRA items mentioned here. http://blog.cloudera.com/blog/2015/08/apache-spark-comes-to-apache-hbase-with-hbase-spark-module/ Question is how to get maven repository <dependency> <groupId>org.apache.hbase</groupId> <artifactId>hbase-spark</artifactId> <version>2.0.0-SNAPSHOT</version> </dependency>