subject:"Re\: Problems with spark.locality.wait"

回复：Re: Problems with spark.locality.wait

2014-11-13 Thread MaChong

ty. Thanks for your guy's help. --Ma Chong - 原始邮件 - 发件人：Mridul Muralidharan 收件人：Kay Ousterhout 抄送人：MaChong , dev 主题：Re: Problems with spark.locality.wait 日期：2014年11月14日 04点53分

Re: Re: Problems with spark.locality.wait

2014-11-13 Thread MaChong

In the specific example stated, the user had two taskset if I understood right ... the first taskset reads off db (dfs in your example), and does some filter, etc and caches it. Second which works off the cached data (which is, now, process local locality level aware) to do map, group, etc. The ta

Re: Problems with spark.locality.wait

2014-11-13 Thread Nathan Kronenfeld

This sounds like it may be exactly the problem we've been having (and about which I recently posted on the user list). Is there any way of monitoring it's attempts to wait, giving up, and trying another level? In general, I'm trying to figure out why we can have repeated identical jobs, the firs

Re: Problems with spark.locality.wait

2014-11-13 Thread Mridul Muralidharan

In the specific example stated, the user had two taskset if I understood right ... the first taskset reads off db (dfs in your example), and does some filter, etc and caches it. Second which works off the cached data (which is, now, process local locality level aware) to do map, group, etc. The ta

Re: Problems with spark.locality.wait

2014-11-13 Thread Kay Ousterhout

Hi Mridul, In the case Shivaram and I saw, and based on my understanding of Ma chong's description, I don't think that completely fixes the problem. To be very concrete, suppose your job has two tasks, t1 and t2, and they each have input data (in HDFS) on h1 and h2, respectively, and that h1 and

Re: Problems with spark.locality.wait

2014-11-13 Thread Mridul Muralidharan

Instead of setting spark.locality.wait, try setting individual locality waits specifically. Namely, spark.locality.wait.PROCESS_LOCAL to high value (so that process local tasks are always scheduled in case the task set has process local tasks). Set spark.locality.wait.NODE_LOCAL and spark.locality

Re: Problems with spark.locality.wait

2014-11-13 Thread Kay Ousterhout

Hi, Shivaram and I stumbled across this problem a few weeks ago, and AFAIK there is no nice solution. We worked around it by avoiding jobs with tasks that have tasks with two locality levels. To fix this problem, we really need to fix the underlying problem in the scheduling code, which currentl

回复：Re: Problems with spark.locality.wait

Re: Re: Problems with spark.locality.wait

Re: Problems with spark.locality.wait

Re: Problems with spark.locality.wait

Re: Problems with spark.locality.wait

Re: Problems with spark.locality.wait

Re: Problems with spark.locality.wait

7 matches

Site Navigation

Mail list logo

Footer information