Several things happened:

1) By far the biggest issue was running well in the cloud.  HBase can be
run in the cloud, but it’s hard to do it well in a cost efficient way.
While each cloud does provide a native key/value store, these don’t all
have the advanced functionality that we need (coprocessors, a transaction
manager).  And we would have had to write an intermediate layer to handle
all the different cloud key/value interfaces.  In constrast, the three
major clouds all offer an option for a database Hive already supports.

2) Getting the transaction manager integrated proved challenging.  We
worked with Omid, but it ate a lot of cycles on the machines we used in our
tests.  Given that Omid and Tephra are both continuing to grow and be
developed this issue could probably have been overcome in time.

3) Consistent feedback that we got from users as we talked to them about
using HBase as the metastore was that they did not know how to administer
HBase.  Most Hive users are already familiar with an RDBMS and feel
comfortable administering one.  It was clear if we were going to include
HBase it would have to be completely embedded and managed automatically by
Hive.  This is doable, but a lot of work.

Daniel’s caching work is showing promising numbers.  Given the above issues
it makes more sense to pursue caching rather than keep going down the HBase
route.  This is particularly true in the cloud where we don’t always get
great response times from the cloud native databases.

Caching will radically improve our query planning performance.  But it
won’t address the test matrix issue.  By supporting four different
databases in production (Derby is only for test) we have to run all the
Hive tests on four different systems to assure ourselves it works.  I’m
trying to mitigate this with the Dockerfile work I’m doing in the
standalone-metastore.  It will make it easier to do that testing, but the
testing will still be required.

Alan.

Alan.



On Tue, Nov 7, 2017 at 3:23 PM, Alexander Kolbasov <ak...@cloudera.com>
wrote:

> I am wondering what happened with HMS on HBase project? I saw a bunch of
> slides telling about the progress and good POC results, but then looks like
> it was abandoned for some reason. Were there any serious issues discovered
> or benefits turned out to be insignificant?
>
> - Alex
>

Reply via email to