[ https://issues.apache.org/jira/browse/HIVE-6459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13905905#comment-13905905 ]
Remus Rusanu commented on HIVE-6459: ------------------------------------ HIVE-6345 just got in, which adds the decimal support for vectorized aggregates, including AVG. Is probably going to conflict with your patch, as vectorized AVG must match the intermediate sum (p,s). If necessary, I will look at your patch tomorrow (I'm on UTC+2) and see how it needs to consider the vectorized aggregate code (it should be a minor change). > Change the precison/scale for intermediate sum result in the avg() udf > ----------------------------------------------------------------------- > > Key: HIVE-6459 > URL: https://issues.apache.org/jira/browse/HIVE-6459 > Project: Hive > Issue Type: Improvement > Components: UDF > Affects Versions: 0.13.0 > Reporter: Xuefu Zhang > Assignee: Xuefu Zhang > Attachments: HIVE-6459.patch > > > The avg() udf, when applied to a decimal column, selects the precision/scale > of the intermediate sum field as (p+4, s+4), which is the same for the > precision/scale of the avg() result. However, the additional scale increase > is unnecessary, and the problem of data overflow may occur. The requested > change is that for the intermediate sum result, the precsion/scale is set to > (p+10, s), which is consistent to sum() udf. The avg() result still keeps its > precision/scale. -- This message was sent by Atlassian JIRA (v6.1.5#6160)