1) What is the best way to convert data from Avro to Parquet so that it can be later read and processed ?
2) Will the performance of processing (join, reduceByKey) be better if both datasets are in Parquet format when compared to Avro + Sequence ? -- Deepak