It seems You are missing HADOOP_HOME in the environment. As it says:
java.io.IOException: Could not locate executable *null*\bin\winutils.exe in
the Hadoop binaries.
That null is supposed to be your HADOOP_HOME.
Thanks
Best Regards
On Thu, Dec 18, 2014 at 7:10 PM, mj wrote:
>
> Hi,
>
> I'm try
Hi,
I'm trying to use pyspark to save a simple rdd to a text file (code below),
but it keeps throwing an error.
- Python Code -
items=["Hello", "world"]
items2 = sc.parallelize(items)
items2.coalesce(1).saveAsTextFile('c:/tmp/python_out.csv')
- Error --C:\Python27\py