Xuefu Zhang created HIVE-5878: --------------------------------- Summary: Hive standard avg UDAF returns double as the return type for some exact input types Key: HIVE-5878 URL: https://issues.apache.org/jira/browse/HIVE-5878 Project: Hive Issue Type: Bug Components: Types, UDF Affects Versions: 0.12.0 Reporter: Xuefu Zhang Assignee: Xuefu Zhang
For standard, no-partial avg result, hive currently returns double as the result type. {code} hive> desc test; OK d int None Time taken: 0.051 seconds, Fetched: 1 row(s) hive> explain select avg(`d`) from test; ... Reduce Operator Tree: Group By Operator aggregations: expr: avg(VALUE._col0) bucketGroup: false mode: mergepartial outputColumnNames: _col0 Select Operator expressions: expr: _col0 type: double {code} However, exact types including integers and decimal should yield exact type. Here is what MySQL does: {code} mysql> desc test; +-------+--------------+------+-----+---------+-------+ | Field | Type | Null | Key | Default | Extra | +-------+--------------+------+-----+---------+-------+ | i | int(11) | YES | | NULL | | | b | tinyint(1) | YES | | NULL | | | d | double | YES | | NULL | | | s | varchar(5) | YES | | NULL | | | dd | decimal(5,2) | YES | | NULL | | +-------+--------------+------+-----+---------+-------+ mysql> desc test62; +-------+---------------+------+-----+---------+-------+ | Field | Type | Null | Key | Default | Extra | +-------+---------------+------+-----+---------+-------+ | sum_t | decimal(14,4) | YES | | NULL | | +-------+---------------+------+-----+---------+-------+ 1 row in set (0.00 sec) {code} -- This message was sent by Atlassian JIRA (v6.1#6144)