Hi All I believe that there is no lineage between datasets. Consider this case:
val people = spark.read.parquet("...").as[Person] val ageGreatThan30 = people.filter("age > 30") Since the second DS can push down the condition, they are obviously different logical plans and hence are different physical plan. What I understanding is right? Thanks Chang