[ 
https://issues.apache.org/jira/browse/HIVE-9128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14249201#comment-14249201
 ] 

Jimmy Xiang commented on HIVE-9128:
-----------------------------------

Played with this setting on spark. With query 28, the total time (ms, sum up 
all the durations for the corresponding method):
|| || with the option off || with the option on ||
| writing the plan | 1124 | 156 |
| reading the plan | 299 | 51 |

It looks like this option does give us some benefit. However, the gain is not 
very significant, since the query itself takes about 80+ seconds.


> Evaluate hive.rpc.query.plan performance [Spark Branch]
> -------------------------------------------------------
>
>                 Key: HIVE-9128
>                 URL: https://issues.apache.org/jira/browse/HIVE-9128
>             Project: Hive
>          Issue Type: Sub-task
>          Components: Spark
>            Reporter: Brock Noland
>            Assignee: Jimmy Xiang
>
> Tez uses 
> [hive.rpc.query.plan|https://github.com/apache/hive/blob/trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java#L1874]
>  which is used in {{Utilities.java}}. Basically instead of writing the query 
> plan to HDFS, the query plan is placed in the JobConf object and then 
> de-serialized form there.
> We should do some evaluation to see which is more performant for us. We might 
> need to place some timings in {{Utilities}} to understand this if the PerfLog 
> doesn't have enough information today.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to