Hi,

I notice that Impala is rarely mentioned these days.  I may be missing
something. However, I gather it is coming to end now as I don't recall many
use cases for it (or customers asking for it). In contrast, Hive has hold
its ground with the new addition of Spark and Tez as execution engines,
support for ACID and ORC and new stuff in Hive 2. In addition provided a
good choice for its metastore it scales well.

If Hive had the ability (organic) to have local variable and stored
procedure support then it would be top notch Data Warehouse. Given its
metastore, I don't see any technical reason why it cannot support these
constructs.

I was recently asked to comment on migration from commercial DWs to Big
Data (primarily for TCO reason) and really could not recall any better
candidate than Hive. Is HBase a viable alternative? Obviously whatever one
decides there is still HDFS, a good engine for Hive (sounds like many
prefer TEZ although I am a Spark fan) and the ubiquitous YARN.

Let me know your thoughts.


Dr Mich Talebzadeh



LinkedIn * 
https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw
<https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw>*



http://talebzadehmich.wordpress.com

Reply via email to