I'm pretty sure my problem is related to this unresolved bug regarding files with size zero:
https://issues.apache.org/jira/browse/SPARK-1960 -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/EOFException-when-I-list-all-files-in-hdfs-directory-tp10648p10649.html Sent from the Apache Spark User List mailing list archive at Nabble.com.