Re: “Pyspark.zip does not exist” using Spark in cluster mode with Yarn

2020-07-16 Thread Hulio andres
ent: Thursday, July 16, 2020 at 6:54 PM > From: "Davide Curcio" > To: "user@spark.apache.org" > Subject: “Pyspark.zip does not exist” using Spark in cluster mode with Yarn > > I'm trying to run some Spark script in cluster mode using Yarn but I've >

“Pyspark.zip does not exist” using Spark in cluster mode with Yarn

2020-07-16 Thread Davide Curcio
I'm trying to run some Spark script in cluster mode using Yarn but I've always obtained this error. I read in other similar question that the cause can be: "Local" set up hard-coded as a master but I don't have it HADOOP_CONF_DIR environment variable that's wrong inside spark-env.sh but it seems

Re: Using spark in cluster mode

2015-10-21 Thread Jacek Laskowski
Hi, Start here -> http://spark.apache.org/docs/latest/programming-guide.html#resilient-distributed-datasets-rdds and then hop to http://spark.apache.org/docs/latest/spark-standalone.html. Once done, be back with your questions. I think it's gonna help a lot. Pozdrawiam, Jacek -- Jacek Laskowski

Using spark in cluster mode

2015-10-20 Thread masoom alam
Dear all I want to setup spark in cluster mode. The problem is that each worker node is looking for a file to process.in its local directory.is it possible to setup some thing hdfs so that each worker node take its part of a file from hdfsany good tutorials for this? Thanks