Spark job's application tracking URL not accessible from docker container

2017-10-31 Thread Divya Narayan
We have streaming jobs and batch jobs running inside the docker containers with spark driver launched within the container Now when we open the Resource manager UI http://:8080, and try to access the application tracking URL of any running job, the page times out with error: HTTP ERROR 500 Probl

unsubscribe

2017-12-12 Thread Divya Narayan

Read Avro Data using Spark Streaming

2018-11-02 Thread Divya Narayan
Hi, I produced avro data to kafka topic using schema registry and now I want to use spark streaming to read that data and do some computation in real time. Can some one please give a sample code for doing that . I couldn't find any working code online. I am using spark version 2.2.0 and spark-stre

hadoop replication property from spark code not working

2019-06-26 Thread Divya Narayan
Hi, I have a use case for which I want to override the default hdfs replication factor from my spark code. For this I have set the hadoop replication like this: val sc = new SparkContext(conf) sc.hadoopConfiguration.set('dfs.replication','1'). Now my spark job runs as a cron job in some specific