Re: Spark scheduling, data locality

2015-03-23 Thread Hui WANG
Hello Zoltan, I'm a spark beginner but i think that the locality preferences should be prepared before the sending of tasks. One important element of a RDD is the metadata on the scheme and location of its partitions. Tasks created in the driver program should be based on this information. I'm al

Spark scheduling, data locality

2015-03-19 Thread Zoltán Zvara
I'm trying to understand the task scheduling mechanism of Spark, and I'm curious about where does locality preferences get evaluated? I'm trying to determine if locality preferences are fetchable before the task get serialized. A HintSet would be most appreciated! Have nice day! Zvara Zoltán m