Hi,
How do we save data to hdfs using pyspark in "right" format.
I use:
counts = counts.saveAsTextFile("hdfs://localhost:1234//foo")
But when I look into the data... It is always in tuple format
(1245,23)
(1235,99)How do i specify output format in pyspark. Thanks
