[ 
https://issues.apache.org/jira/browse/HIVE-28302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HIVE-28302:
----------------------------------
    Labels: pull-request-available  (was: )

> Let SUM UDF return NULL when all rows have non-numeric texts
> ------------------------------------------------------------
>
>                 Key: HIVE-28302
>                 URL: https://issues.apache.org/jira/browse/HIVE-28302
>             Project: Hive
>          Issue Type: Improvement
>          Components: UDF
>    Affects Versions: 4.0.0
>            Reporter: Shohei Okumiya
>            Assignee: Shohei Okumiya
>            Priority: Major
>              Labels: pull-request-available
>
> The ANSI standard says UDAF should return NULL when all inputs are NULL.
> Hive is very generous and accepts non-numeric texts as input for SUM. To give 
> some consistency to the generous specification, we believe `SUM(string_col)` 
> should behave in the same way as `SUM(CAST(string_col AS DOUBLE))`.
> However, Hive's SUM returns 0.0 in that case.
> {code:java}
> > SELECT SUM(CAST(null AS STRING)), SUM('invalid num'), SUM(CAST('invalid 
> > num' AS DOUBLE));
> +-------+------+-------+
> |  _c0  | _c1  |  _c2  |
> +-------+------+-------+
> | NULL  | 0.0  | NULL  |
> +-------+------+-------+ {code}
> We see some more discussions in https://github.com/apache/hive/pull/5091.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to