Hi Manan,

Thanks for the KIP.
This is a good improvement.

Questions:
1. After reading the KIP, I still don't understand the difference between
"incremental mode" and "non-incremental mode".
>From what I can see is that they both run with reassignment-batch-size once
time.
What's the difference between them?
Could you explain more?
Maybe some examples would be helpful to help users know the difference and
how they choose them.


2. I see there are "INCREMENTAL_REASSIGNMENT_POLL_INTERVAL_MS" and
"reassignment-poll-interval-ms".
What's the difference between them?


Thank you,
Luke


On Mon, May 25, 2026 at 11:06 PM Manan Gupta <[email protected]> wrote:

> Hey TaiJuWu
>
> Thank you for reviewhing the KIP, my response is inline.
>
> > TJ00: If we have multiple batch requests, how do you handle single batch
> failure?
> - If a submit step fails, the tool returns immediately with errors and does
> not enqueue the rest; partitions already submitted stay under the
> controller’s reassignment as they do today.
> - The process exits with a TerseException listing the failed partitions and
> the error message from the broker/controller (the same pattern as a
> single-shot execute when some alters fail).
>
> > TJ01: If there is a long time operation, how can the users know it still
> running instead of hang?
> - Controller / cluster side: ongoing reassignments and replication
> (metrics, kafka-reassign-partitions --list, Admin / JMX).
> - verify in another terminal shows progress toward the target.
> Batch wait is mostly quiet; incremental is a bit chattier; true progress is
> best observed from cluster state or --verify, not only from stdout during
> the wait loop.
>
> Thanks,
> Manan Gupta
>
> On Mon, May 25, 2026 at 6:06 PM TaiJu Wu <[email protected]> wrote:
>
> > Hi Manan,
> >
> > Thanks for the KIP, just for some question.
> >
> > TJ00: If we have multiple batch requests, how do you handle single batch
> > failure?
> >
> > TJ01: If there is a long time operation, how can the users know it still
> > running instead of hang?
> >
> > Thanks,
> > TaiJuWu
> >
> >
> >
> > Manan Gupta <[email protected]> 於 2026年5月18日週一 下午6:09寫道:
> >
> > > Hey Kamal
> > >
> > > Thank you for your comments.
> > >
> > > > Should we have a configurable list poll interval?
> > > The current fixed interval of 500ms should not degrade the controller
> > but I
> > > agree that operators should have an option to change this value,
> updated
> > > the KIP to also take another parameter reassignment-poll-interval-ms to
> > > update the default value from 500 ms.
> > >
> > > > Shall we extend the batching logic to also kafka-leader-election
> > script?
> > > Good point, I will pick this up as a separate KIP as a followup to this
> > > KIP.
> > >
> > > Thanks,
> > > Manan
> > >
> > > On Mon, May 18, 2026 at 2:52 PM Kamal Chandraprakash <
> > > [email protected]> wrote:
> > >
> > > > Hi Manan,
> > > >
> > > > Thanks for improving the user-facing tools! Overall LGTM. Few
> > questions:
> > > >
> > > > 1. Should we have a configurable list poll interval? With 500ms, does
> > it
> > > > poll the controller often to list the currently running reassignments
> > for
> > > > large partitions?
> > > > 2. Shall we extend the batching logic to also kafka-leader-election
> > > script?
> > > > It will be useful when running with --all-topic-partitions.
> > > >
> > > > Thanks,
> > > > Kamal
> > > >
> > > >
> > > > On Mon, May 11, 2026 at 8:55 AM Manan Gupta <[email protected]>
> > > wrote:
> > > >
> > > > > Hello
> > > > >
> > > > > Gentle reminder to review the KIP.
> > > > >
> > > > > Thanks,
> > > > > Manan
> > > > >
> > > > > On Wed, May 6, 2026 at 7:52 PM Manan Gupta <[email protected]>
> > > wrote:
> > > > >
> > > > > > Hi all,
> > > > > >
> > > > > > This email starts the discussion thread for *KIP-1335: Bounded
> > > > > > concurrency for partition reassignment via
> > > > kafka-reassign-partitions.sh*.
> > > > > > The proposal adds optional reassignment-batch-size and
> incremental
> > > > > > parameters to kafka-reassign-partitions.sh so operators can cap
> how
> > > > many
> > > > > > partition reassignments are submitted or kept in flight at once
> > using
> > > > > > existing Admin API,
> > > > > >
> > > > > > I will appreciate your initial thoughts and feedback on the
> > proposal.
> > > > > >
> > > > > > https://cwiki.apache.org/confluence/x/8ZAmGQ
> > > > > >
> > > > > > Thanks,
> > > > > > Manan
> > > > > >
> > > > >
> > > >
> > >
> >
>

Reply via email to