Re: Graceful Shutdown on Kafka Streams exception handler

Matthias J. Sax Wed, 03 Sep 2025 18:00:13 -0700

Hello Victor,

thanks for reaching out. I am not sure if it would be easily possible toimplement what you propose. The problem is really, that records might bepartially processed when an error happens, and before we can figure outwhat the correct offset for committing is, KS needs to flush, but afterwe hit an error, flushing cleanly might not be possible any longer.

Beside the above issue, w/o EOS (ie, Kafka transactions), there is allkind of other scenarios that could lead to duplicate output, orreprocessing an input record a second time. So even if we can implementwhat you propose (we would need to investigate in more details if itmight be possible or not), it could only reduce duplicates, but notfully eliminate them. Just want to make sure you are aware of this.

So if you are really interested to contribute such a feature, and youcan figure out how to do this correctly, yes, please write a KIP aboutit :) -- We can also try to help you with this investigation (it mightrequire some POC PR...); however, not sure how much time we can find atmto support you TBH.

However, I am wondering what kind of performance you need, and why EOSwould not be able to deliver. There is many configs, so maybe there is away to tune your app to make it work with EOS?



-Matthias


On 9/3/25 10:50 AM, Victor Osorio wrote:

Hello everyone,
We’re currently using Kafka Streams to process transactional data with*exactly-once semantics (EOS)*. However, for some of our workloads, werequire higher throughput, which makes EOS impractical.
To ensure data integrity, we relyon UncaughtExceptionHandler and ProductionExceptionHandler to haltstream processing upon any exception. This prevents data loss butintroduces a new challenge: when a thread stops due to an exception, itdoesn’t commit the records that were already successfully processed. Asa result, when the stream restarts, those records are reprocessed,leading to duplication.
While reviewing the discussion around KIP-1033, I noticed the suggestionto avoid exposing commit functionality in the Kafka Streams API(https://lists.apache.org/thread/k4v0737tqjdnq5vl3yp9rjr4qzqoo306<https://lists.apache.org/thread/k4v0737tqjdnq5vl3yp9rjr4qzqoo306>).That makes sense in many contexts, but I’d like to revisit a related idea:
*Could we introduce a new shutdown mechanism, perhaps a “GracefulShutdown” API, that commits all successfully processed records whileskipping the one that caused the failure?*
This would allow us to maintain data integrity without sacrificingthroughput or introducing duplicates. I’m curious to hear your thoughts:
  * Would this be possible to implement with current Kafka Streams APIs?
  * Is that possible, or desired, to be added as a Kafka Streams feature
    in further releases? If yes, we can open a KIP.

Looking forward to your insights and feedback.

Best regards,
Victor Osório

amdocs-2017-brand-mark-rgb
*This email and the information contained herein is proprietary andconfidential and subject to the Amdocs Email Terms of Service, which youmay review at**https://www.amdocs.com/about/email-terms-of-service*<https://www.amdocs.com/about/email-terms-of-service>

Re: Graceful Shutdown on Kafka Streams exception handler

Reply via email to