Re: collecting lists in non-natural order / SORT BY on columns which are not part of the final result set

2015-06-13 Thread Robin Verlangen
Hi Michael, You can try using a with statement, pseudo: WITH input AS (SELECT colA, colB FROM table ORDER BY colA ASC) SELECT colB FROM input Best regards, Robin Verlangen *Chief Data Architect* W http://www.robinverlangen.nl E ro...@us2.nl *What is CloudPelican?

collecting lists in non-natural order / SORT BY on columns which are not part of the final result set

2015-06-13 Thread Michael Häusler
Hi there, imagine you have a table of time-series transactions, e.g.: CREATE TABLE foobar (actor BIGINT, ts BIGINT, action STRING); containing the rows: 1 2000bar 1 1000foo 2 1500foo An interesting query would be to get a denormalized view on all actions of an

Re: Hive double issues while moving around RC files between clusters

2015-06-13 Thread Robin Verlangen
One thing I found in the change logs was this https://issues.apache.org/jira/browse/HIVE-7041 which sounds like it might have something to do with this. I don't use any byte datatypes in the structure, so it would be hard to verify those. Best regards, Robin Verlangen *Chief Data Architect* W ht

Hive double issues while moving around RC files between clusters

2015-06-13 Thread Robin Verlangen
Hi there, I was copying around RC files from an CDH hadoop 2.0 cluster to a new HDP hadoop 2.6 cluster. After creating a new table with the storage options RC file and LOCATION pointing to the right direction I can query all columns, except for the ones that are double. I tried querying with Hiv

Re: nested join issue

2015-06-13 Thread Gautam
To clarify, HIVE-8435 introduced the regression. Turning that feature off fixes the issue. So we still need to fix that optimization to not produce this incorrect result. On Fri, Jun 12, 2015 at 11:31 PM, Gautam wrote: > Found that turning off hive.optimize.remove.identity.project ( ref: > HIVE-