I am Ok with +1.

Having said that there is a merit IMO to add a matrix highlighting the
differences between real time and Continuous Processing (Continuous Mode)
to SPIP. Unless the assumption is that spark has abandoned the Continuous
Mode) altogether

*Feature           Real-time Processing (via Micro-batching)
 Continuous Processing (Continuous Mode)*
Underlying Model  optimized micro-batching (small, frequent batches)
Truly continuous, event-at-a-time (or near event-at-a-time) processing via
long-running tasks.
Latency Goal      Low latency (seconds to hundreds of milliseconds)
  Ultra-low latency (single-digit or sub-millisecond) for true real-time.
                  for perceived real-time.

Fault Tolerance   Highly robust, exactly-once processing guarantees
At-least-once processing is standard. Exactly-once for stateful operations
is more complex and depends on external systems (e.g., transactional
               sinks) or careful custom
implementation for both stateless and stateful operations

 etc


Dr Mich Talebzadeh,
Architect | Data Science | Financial Crime | Forensic Analysis | GDPR

   view my Linkedin profile
<https://www.linkedin.com/in/mich-talebzadeh-ph-d-5205b2/>





On Sun, 1 Jun 2025 at 20:01, L. C. Hsieh <vii...@gmail.com> wrote:

> Hi all,
>
> I would like to start a vote on the new real-time mode in Apache Spark
> Structured Streaming.
>
> Discussion thread:
> https://lists.apache.org/thread/ovmfbzfkc3t9odvv5gs75fhpvdckn90f
> SPIP:
> https://docs.google.com/document/d/1CvJvtlTGP6TwQIT4kW6GFT1JbdziAYOBvt60ybb7Dw8/edit?tab=t.0#heading=h.ulas5788cm9t
> JIRA: https://issues.apache.org/jira/browse/SPARK-52330
>
> Please vote on the SPIP for the next 72 hours:
>
> [ ] +1: Accept the proposal as an official SPIP
> [ ] +0
> [ ] -1: I don’t think this is a good idea because …
>
> ---------------------------------------------------------------------
> To unsubscribe e-mail: dev-unsubscr...@spark.apache.org
>
>

Reply via email to