On Wed, Jul 15, 2015 at 5:36 AM, Jeskanen, Elina <elina.jeska...@cgi.com> wrote:
> I have Spark 1.4 on my local machine and I would like to connect to our > local 4 nodes Cloudera cluster. But how? > > > > In the example it says text_file = spark.textFile("hdfs://..."), but can > you advise me in where to get this "hdfs://..." -address? > In 99% of the cases, you shouldn't need it. Just set "HADOOP_CONF_DIR" to a directory containing your HDFS configuration, and just pass the path to the files to process (without the "hdfs:..." prefix) to that API. -- Marcelo