Hi, I notice that Impala is rarely mentioned these days. I may be missing something. However, I gather it is coming to end now as I don't recall many use cases for it (or customers asking for it). In contrast, Hive has hold its ground with the new addition of Spark and Tez as execution engines, support for ACID and ORC and new stuff in Hive 2. In addition provided a good choice for its metastore it scales well.
If Hive had the ability (organic) to have local variable and stored procedure support then it would be top notch Data Warehouse. Given its metastore, I don't see any technical reason why it cannot support these constructs. I was recently asked to comment on migration from commercial DWs to Big Data (primarily for TCO reason) and really could not recall any better candidate than Hive. Is HBase a viable alternative? Obviously whatever one decides there is still HDFS, a good engine for Hive (sounds like many prefer TEZ although I am a Spark fan) and the ubiquitous YARN. Let me know your thoughts. Dr Mich Talebzadeh LinkedIn * https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw <https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw>* http://talebzadehmich.wordpress.com