Sandy Ryza created SPARK-51727: ---------------------------------- Summary: SPIP: Declarative Pipelines Key: SPARK-51727 URL: https://issues.apache.org/jira/browse/SPARK-51727 Project: Spark Issue Type: Improvement Components: Spark Core Affects Versions: 4.1.0 Reporter: Sandy Ryza
The SPIP proposes a new abstraction that combines multiple transformations into a single declarative dataflow graph, to simplify the development and management of data pipelines. The approach extends Spark's lazy, declarative execution model beyond single queries, to pipelines that keep multiple datasets up to date. This reduces cognitive overhead and manual orchestration of dependencies. Declarative pipelines can include both batch and streaming computations, leveraging Spark Streaming for stream processing and new materialized view syntax for batch processing. SPIP doc: https://docs.google.com/document/d/1PsSTngFuRVEOvUGzp_25CQL1yfzFHFr02XdMfQ7jOM4/edit?tab=t.0 -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org