On Tue, Jun 20, 2017 at 10:12 AM, Edward Capriolo <edlinuxg...@gmail.com> wrote:
> It is whack that two optimized row columnar formats exists and each > respective project (hive/impala) has good support for one and lame/no > support for the other. > We have two similar formats because they were designed at roughly the same time by different teams with similar, but not identical goals. Part of having a thriving ecosystem is that there are competitors, which creates some user confusion, but makes the ecosystem stronger. (Can someone please make a competitor for Oozie? *grin*) Hive and LLAP do support Parquet precisely because the developers want to be able to process everyone's data. The Impala project is free to make their own choices about what to work on. .. Owen