[ https://issues.apache.org/jira/browse/HIVE-16029?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15884347#comment-15884347 ]
Edward Capriolo commented on HIVE-16029: ---------------------------------------- I do not thin you should change this now. This will change the behavior of many applications. I am -1 in the current form. I suggest you do this. CollectSet(a) <- original behaivor CollectSet(a,true) <- allow nulls. This way you get your feature and the result set you want and existing applications are not effected. > COLLECT_SET and COLLECT_LIST does not return NULL in the result > --------------------------------------------------------------- > > Key: HIVE-16029 > URL: https://issues.apache.org/jira/browse/HIVE-16029 > Project: Hive > Issue Type: Bug > Affects Versions: 2.1.1 > Reporter: Eric Lin > Assignee: Eric Lin > Priority: Minor > Attachments: HIVE-16029.patch > > > See the test case below: > {code} > 0: jdbc:hive2://localhost:10000/default> select * from collect_set_test; > +---------------------+ > | collect_set_test.a | > +---------------------+ > | 1 | > | 2 | > | NULL | > | 4 | > | NULL | > +---------------------+ > 0: jdbc:hive2://localhost:10000/default> select collect_set(a) from > collect_set_test; > +---------------+ > | _c0 | > +---------------+ > | [1,2,4] | > +---------------+ > {code} > The correct result should be: > {code} > 0: jdbc:hive2://localhost:10000/default> select collect_set(a) from > collect_set_test; > +---------------+ > | _c0 | > +---------------+ > | [1,2,null,4] | > +---------------+ > {code} -- This message was sent by Atlassian JIRA (v6.3.15#6346)