Can you clarify what you mean by "used for all stages"? OutputCommitCoordinator RPCs should only be initiated through SparkHadoopMapRedUtil.commitTask(), so while the OutputCommitCoordinator doesn't make a distinction between ShuffleMapStages and ResultStages there still should not be a performance penalty for this because the extra rounds of RPCs should only be performed when necessary.

On 8/11/15 2:25 AM, Jeff Zhang wrote:
As my understanding, OutputCommitCoordinator should only be necessary for ResultStage (especially for ResultStage with hdfs write), but currently it is used for all the stages. Is there any reason for that ?

--
Best Regards

Jeff Zhang


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@spark.apache.org
For additional commands, e-mail: dev-h...@spark.apache.org

Reply via email to