Re: The build-in indexes in ORC file does not work.

Mich Talebzadeh Sat, 19 Mar 2016 00:33:17 -0700

Hi Gopal,

I am using Hive 2 on Spark 1.3.1 engine.


OK, This is only a test table. What would be the best way to create this
table in Hive as ORC format?

Thanks


Dr Mich Talebzadeh



LinkedIn * 
https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw
<https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw>*



http://talebzadehmich.wordpress.com



On 16 March 2016 at 17:45, Gopal Vijayaraghavan <gop...@apache.org> wrote:

>
> > I love to see these ORC table optimization help but it is not obvious to
> >me under what circumstances they bare fruit.
>
> Are you using Tez or LLAP? Your explain plans are clearly missing the
> optimizations I've added as part of Stinger.next.
>
> https://github.com/apache/hive/blob/master/ql/src/test/results/clientpositi
> ve/tez/bucketpruning1.q.out#L64
>
>
> > I am trying to get the whole row for ID = 20. Pretty straight forward
> ...
> >                TableScan
> ...
> > is very misleading as soon as I see it, it gives the impression that it
> >is going through the whole table.
>
> Because it *is* a table-scan and the ORC PPD/bloom indexes have
> false-positives - a pure index lookup does not need a subsequent Filter.
>
> FYI, if you're looking at simple "select *" with bucketing, you should
> vote/watch - https://issues.apache.org/jira/browse/HIVE-12379
>
> Cheers,
> Gopal
>
>
>

Re: The build-in indexes in ORC file does not work.

Reply via email to