I wanna be able to read snappy compressed files in spark. I can do a val df = spark.read.textFile("hdfs:// path") and it passes that test in spark shell but beyond that when i do a df.show(10,false) or something - it shows me binary data mixed with real text - how do I read the decompressed file in spark - I can build a dataframe reader if someone guides or nudges me in right direction ...
- read snappy compressed files in spark Ricky