Hi all,

We have written a blog article that compares Hive 4 on MR3 and Trino 453
(the latest release of Trino) using the 10TB TPC-DS benchmark. While it
does not evaluate vanilla Hive, the results might be useful to those
interested in the overall speed of Hive 4 in LLAP mode.

https://www.datamonad.com/post/2024-08-01-hive-4.0-performance-1.11/

Thanks,

--- Sungwoo


On Wed, Jul 31, 2024 at 12:00 AM David Engel <da...@istwok.net> wrote:

> Congratulations, Sungwoo.  I look forward to trying out Hive 4.0.0 on
> MR3 when my time allows.
>
> David
>
> On Tue, Jul 30, 2024 at 11:29:02PM +0900, Sungwoo Park wrote:
> > Hi all,
> >
> > We would like to announce the release of Hive 4.0.0 on MR3. It is based
> on
> > Hive 4.0.0 (together with 17 commits from hive-4.0.1-must) and replaces
> Tez
> > by a new execution engine MR3 (version 1.11). We have also released Hive
> > 3.1.3 on MR3 which backports about 800 patches to branch-3.1.
> >
> > The git repo for Hive 4.0.0 on MR3 is available at:
> > https://github.com/mr3project/hive-mr3/tree/master4.0
> >
> > Compared with Apache Hive, Hive on MR3 (Hive-MR3) has several new
> features.
> >
> > - Hive-MR3 is easy to set up because it automatically achieves the speed
> of
> > LLAP without requiring any extra steps. LLAP IO cache is also easy to
> > enable.
> > - Hive-MR3 can run directly on Kubernetes. On public clouds, you can take
> > advantage of autoscaling and spot instances. We provide shell scripts,
> helm
> > chars, and TypeScript code for executing Hive-MR3 on Kubernetes.
> > - Hive-MR3 can run in standalone mode (similarly to Spark standalone mode
> > and Presto/Trino).
> > - Hive-MR3 supports Java 17.
> > - A single master manages all jobs while containers are shared by all
> > running jobs. As a result, Hive-MR3 yields much higher throughput for
> > concurrent batch jobs.
> >
> > All the documentation on Hive-MR3 is at: https://mr3docs.datamonad.com/
> >
> > Thanks,
> >
> > --- Sungwoo
>
> --
> David Engel
> da...@istwok.net
>

Reply via email to