Re: Spark paragraph hangs

Nabeel Imtiaz Fri, 23 Nov 2018 12:55:31 -0800

I’m using stock spark interpreter that comes with zeppelin 0.8.0.

Locally I’ve successful run spark jobs on a local standalone cluster.



Nabeel

> On Nov 24, 2018, at 00:40, JAIN, RAHUL <rj3...@att.com> wrote:
> 
> 
> Does the job work when your spark master is configured as local in zeppelin 
> spark interpreter config ?  If that works well then your real issue may be in 
> your spark/yarn cluster.
> 
> Also, have you tried doing a spark-submit manually for a similar job on 
> command line pointing to the cluster ?
> 
> -Rahul
> 
> On 11/23/18, 2:28 AM, "Nabeel Imtiaz" <nimti...@gmail.com> wrote:
> 
>    Nothing much useful. Following are the interpreter logs I can see before 
> the job just hangs:
> 
>    INFO [2018-11-23 14:25:58,517] ({pool-2-thread-3} 
> SchedulerFactory.java[jobStarted]:109) - Job 20181123-112240_1827913615 
> started by scheduler interpreter_34089707
>    INFO [2018-11-23 14:25:59,241] ({pool-2-thread-3} 
> FileInputFormat.java[listStatus]:253) - Total input paths to process : 1
>    INFO [2018-11-23 14:25:59,299] ({pool-2-thread-3} 
> Logging.scala[logInfo]:54) - Starting job: take at <console>:28
>    INFO [2018-11-23 14:25:59,317] ({dag-scheduler-event-loop} 
> Logging.scala[logInfo]:54) - Got job 0 (take at <console>:28) with 1 output 
> partitions
>    INFO [2018-11-23 14:25:59,319] ({dag-scheduler-event-loop} 
> Logging.scala[logInfo]:54) - Final stage: ResultStage 0 (take at <console>:28)
>    INFO [2018-11-23 14:25:59,320] ({dag-scheduler-event-loop} 
> Logging.scala[logInfo]:54) - Parents of final stage: List()
>    INFO [2018-11-23 14:25:59,323] ({dag-scheduler-event-loop} 
> Logging.scala[logInfo]:54) - Missing parents: List()
>    INFO [2018-11-23 14:25:59,328] ({dag-scheduler-event-loop} 
> Logging.scala[logInfo]:54) - Submitting ResultStage 0 
> (/tmp/earthquake/GEM-GHEC-v1_2.txt MapPartitionsRDD[1] at textFile at 
> <console>:25), which has no missing parents
> 
> 
>    Nabeel
> 
>> On Nov 23, 2018, at 12:33 PM, 王刚 <zjuwa...@gmail.com> wrote:
>> 
>> Is there something useful information your local  spark process  log?
>> 
>>> 在 2018年11月23日，下午4:20，Nabeel Imtiaz <nimti...@gmail.com> 写道：
>>> 
>>> Hi,
>>> 
>>> 
>>> When I trying to even simply take first 10 lines of a file (like 
>>> ```batchData.take(10).foreach(println _)``` from sprakContext, the 
>>> paragraph hangs. 
>>> 
>>> If I inspect the job in spark console, it shows the job in PENDING state. I 
>>> check that I have more than enough memory in the system available. 
>>> 
>>> Is it a known issue? Any fixes or workaround?
>>> 
>>> 
>>> 
>>> Nabeel
>> 
> 
> 
>

Re: Spark paragraph hangs

Reply via email to