Can you please share the native support of data formats available with Spark.
Two i can see are parquet and textFile sc.parquetFile sc.textFile I see that Hadoop Input Formats (Avro) are having issues, that i faced in earlier threads and seems to be well known. https://issues.apache.org/jira/browse/SPARK-993 https://issues.apache.org/jira/browse/SPARK-1018 Hence i want to know which data formats have full support in Spark. I can think of moving to those data formats. -- Deepak