I'm beginner to big data. I don't have too much knowledge about hbase/hive.
What's the difference between hbase and hive/hdfs for storing data for analytics? Thanks, Jared ________________________________ From: ayan guha <guha.a...@gmail.com> Sent: Wednesday, July 20, 2016 9:34:24 PM To: Rabin Banerjee Cc: user; Yu Wei; Deepak Sharma Subject: Re: Is it good choice to use DAO to store results generated by spark application? Just as a rain check, saving data to hbase for analytics may not be the best choice. Any specific reason for not using hdfs or hive? On 20 Jul 2016 20:57, "Rabin Banerjee" <dev.rabin.baner...@gmail.com<mailto:dev.rabin.baner...@gmail.com>> wrote: Hi Wei , You can do something like this , foreachPartition( (part) => { val conn = ConnectionFactory.createConnection(HBaseConfiguration.create()); val table = conn.getTable(TableName.valueOf(tablename)); //part.foreach((inp)=>{println(inp);table.put(inp)}) //This is line by line put table.put(part.toList.asJava) table.close(); conn.close(); \ Now if you want to wrap it inside a DAO,its upto you. Making DAO will abstract thing , but ultimately going to use the same code . Note: Always use Hbase ConnectionFactory to get connection ,and dump data per partition basis. Regards, Rabin Banerjee On Wed, Jul 20, 2016 at 12:06 PM, Yu Wei <yu20...@hotmail.com<mailto:yu20...@hotmail.com>> wrote: I need to write all data received from MQTT data into hbase for further processing. They're not final result. I also need to read the data from hbase for analysis. Is it good choice to use DAO in such situation? Thx, Jared ________________________________ From: Deepak Sharma <deepakmc...@gmail.com<mailto:deepakmc...@gmail.com>> Sent: Wednesday, July 20, 2016 12:34:07 PM To: Yu Wei Cc: spark users Subject: Re: Is it good choice to use DAO to store results generated by spark application? I am using DAO in spark application to write the final computation to Cassandra and it performs well. What kinds of issues you foresee using DAO for hbase ? Thanks Deepak On 19 Jul 2016 10:04 pm, "Yu Wei" <yu20...@hotmail.com<mailto:yu20...@hotmail.com>> wrote: Hi guys, I write spark application and want to store results generated by spark application to hbase. Do I need to access hbase via java api directly? Or is it better choice to use DAO similar as traditional RDBMS? I suspect that there is major performance downgrade and other negative impacts using DAO. However, I have little knowledge in this field. Any advice? Thanks, Jared