Hi Gopal, I am using Hive 2 on Spark 1.3.1 engine.
OK, This is only a test table. What would be the best way to create this table in Hive as ORC format? Thanks Dr Mich Talebzadeh LinkedIn * https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw <https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw>* http://talebzadehmich.wordpress.com On 16 March 2016 at 17:45, Gopal Vijayaraghavan <gop...@apache.org> wrote: > > > I love to see these ORC table optimization help but it is not obvious to > >me under what circumstances they bare fruit. > > Are you using Tez or LLAP? Your explain plans are clearly missing the > optimizations I've added as part of Stinger.next. > > https://github.com/apache/hive/blob/master/ql/src/test/results/clientpositi > ve/tez/bucketpruning1.q.out#L64 > > > > I am trying to get the whole row for ID = 20. Pretty straight forward > ... > > TableScan > ... > > is very misleading as soon as I see it, it gives the impression that it > >is going through the whole table. > > Because it *is* a table-scan and the ORC PPD/bloom indexes have > false-positives - a pure index lookup does not need a subsequent Filter. > > FYI, if you're looking at simple "select *" with bucketing, you should > vote/watch - https://issues.apache.org/jira/browse/HIVE-12379 > > Cheers, > Gopal > > >