Why are you thinking of using HBase?

Just store the SCD versions in a normal Hive dimension table. In case you are worried about updates to columns such as 'valid to' and 'latest record indicator' you can calculate these on the fly using window functions. No need to create and update them physically. You can read more about it here https://sonra.io/2017/05/15/dimensional-modeling-and-kimball-data-marts-in-the-age-of-big-data-and-hadoop/



On 07/06/2017 11:13, Ramasubramanian Narayanan wrote:
Hi,

Can you please let us know Pro and Cons of using HBase table as an external table in HIVE.

Will there be any performance degrade when using Hive over HBase instead of using direct HIVE table.

The table that I am planning to use in HBase will be master table like account, customer. Wanting to achieve Slowly Changing Dimension. Please through some lights on that too if you have done any such implementations.

Thanks and Regards,
Rams

--
___________________________
Uli Bethke
CEO Sonra
p: +353 86 32 83 040
w: www.sonra.io
l: linkedin.com/in/ulibethke
t: twitter.com/ubethke
s: uli.bethke

Chair Hadoop User Group Ireland
www.hugireland.org
Associate President DAMA Ireland

Reply via email to