it seems union should work for this scenario in part C, try to use: output_a union output_b
-- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Spark-execution-plan-tp10482p10491.html Sent from the Apache Spark User List mailing list archive at Nabble.com.