[ 
https://issues.apache.org/jira/browse/HIVE-8486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14174292#comment-14174292
 ] 

Chao commented on HIVE-8486:
----------------------------

1) I discussed with [~szehon] about this, and seems we cannot change the 
calculation now, since it will affect the way how the buckets are calculated. 
2) In the Spark branch, we are not doing anything to estimate the number of 
reducers. In MR, in case this number is not set, it will estimate and set it at 
runtime. This is done in {{MapRedTask}}. Tez also uses "Auto Reducer 
Parallelism" (see HIVE-7158) to control this. As result, I think we should do 
something to resolve this. 

> TPC-DS Query 96 parallelism is not set correcly
> -----------------------------------------------
>
>                 Key: HIVE-8486
>                 URL: https://issues.apache.org/jira/browse/HIVE-8486
>             Project: Hive
>          Issue Type: Sub-task
>          Components: Spark
>            Reporter: Brock Noland
>            Assignee: Chao
>
> When we run the query on a 20B we only have a parallelism factor of 1.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to