Lineage between Datasets

Chang Chen Wed, 12 Apr 2017 03:04:05 -0700

Hi All

I believe that there is no lineage between datasets. Consider this case:


val people = spark.read.parquet("...").as[Person]

val ageGreatThan30 = people.filter("age > 30")

Since the second DS can push down the condition, they are obviously
different logical plans and hence are different physical plan.

What I understanding is right?

Thanks
Chang

Lineage between Datasets

Reply via email to