Re: Spark and HDFS

Marcelo Vanzin Wed, 15 Jul 2015 17:46:46 -0700

On Wed, Jul 15, 2015 at 5:36 AM, Jeskanen, Elina <elina.jeska...@cgi.com>
wrote:


>  I have Spark 1.4 on my local machine and I would like to connect to our
> local 4 nodes Cloudera cluster. But how?
>
>
>
> In the example it says text_file = spark.textFile("hdfs://..."), but can
> you advise me in where to get this "hdfs://..." -address?
>

In 99% of the cases, you shouldn't need it. Just set "HADOOP_CONF_DIR" to a
directory containing your HDFS configuration, and just pass the path to the
files to process (without the "hdfs:..." prefix) to that API.

-- 
Marcelo

Re: Spark and HDFS

Reply via email to