Hi Manan,

LC1: Thanks for the explanation. It's clear to me now.
I think we should also put this example and the "How to choose" part in the
KIP.

Some more questions:
LC3. How does the batch mode know that all N partitions are completed and
then start the next batch?
It looks like we don't poll the status when in batch mode. How do we know
that?

LC4. What will it show when some partitions are still waiting to be
progressed?
Currently, the --verify only shows "is completed" or "is still in
progress".
Should we have an output for the partitions that are sitting in the batch
queue?

LC5. As you've pointed out, there could be a possibility that it will poll
indefinitely.
Why can't we set a timer for it?
Any concerns about it?

LC6. "reassignment-poll-interval-ms" default to 500ms is too aggressive.
I think from users' perspective, any interval < 3 seconds or 5 seconds is
considered acceptable.
So could we increase it to at least 1 second?

Thank you,
Luke

On Mon, Jun 1, 2026 at 3:50 PM Manan Gupta <[email protected]> wrote:

> Hey Luke
> Thank you for reviewing the proposal.
>
> LC1:
> Please excuse me if my explanation of the two different modes was unclear.
>
> In non-incremental mode the tool walks the plan in steps. Each step submits
> up to N partition reassignments, then waits until every partition in that
> step has finished before it opens the next step. The slowest partition in
> the current step holds up the entire next step.
>
> In incremental mode N is not “how big each step is.” It is how many
> partition reassignments from this plan may be active at the same time. The
> tool keeps refilling up to N: whenever any single partition completes, it
> can start the next one from the queue. There is no rule that the whole
> group of N must finish together before new work starts.
>
> Example: 10 partitions in sorted order P1 through P10, N equals 3.
>
> Non-incremental: Step one submits P1 P2 P3 and waits until all three are
> done. Step two submits P4 P5 P6 and waits until all three are done. Step
> three submits P7 P8 P9 and waits until all three are done. Step four
> submits P10 only. If P3 is slow, P4 cannot start until P3 finishes, even if
> P1 and P2 are already done.
>
> Incremental: The tool first submits P1 P2 P3 so three reasginemnts are
> active. If P2 finishes first, it can submit P4 while P1 and P3 are still
> running, still keeping three active when possible. It continues that way
> until every partition in the plan has been submitted and the in-flight work
> drains according to the tool semantics. If P3 is slow, P4 can still start
> as soon as some other slot frees up.
>
> How to choose: use non-incremental if you want clear steps and a strict
> “this whole batch finished before the next batch starts” story. Use
> incremental if you want steadier utilization when finish times differ and
> you do not want one slow partition to block starting unrelated partitions
> beyond the cap of N at once.
>
> LC2:
> Both these values are the same, I have updated the KIP to reflect that now.
>
> Regards
> Manan Gupta
>
>
> On Mon, Jun 1, 2026 at 9:52 AM Luke Chen <[email protected]> wrote:
>
> > Hi Manan,
> >
> > Thanks for the KIP.
> > This is a good improvement.
> >
> > Questions:
> > 1. After reading the KIP, I still don't understand the difference between
> > "incremental mode" and "non-incremental mode".
> > From what I can see is that they both run with reassignment-batch-size
> once
> > time.
> > What's the difference between them?
> > Could you explain more?
> > Maybe some examples would be helpful to help users know the difference
> and
> > how they choose them.
> >
> >
> > 2. I see there are "INCREMENTAL_REASSIGNMENT_POLL_INTERVAL_MS" and
> > "reassignment-poll-interval-ms".
> > What's the difference between them?
> >
> >
> > Thank you,
> > Luke
> >
> >
> > On Mon, May 25, 2026 at 11:06 PM Manan Gupta <[email protected]>
> wrote:
> >
> > > Hey TaiJuWu
> > >
> > > Thank you for reviewhing the KIP, my response is inline.
> > >
> > > > TJ00: If we have multiple batch requests, how do you handle single
> > batch
> > > failure?
> > > - If a submit step fails, the tool returns immediately with errors and
> > does
> > > not enqueue the rest; partitions already submitted stay under the
> > > controller’s reassignment as they do today.
> > > - The process exits with a TerseException listing the failed partitions
> > and
> > > the error message from the broker/controller (the same pattern as a
> > > single-shot execute when some alters fail).
> > >
> > > > TJ01: If there is a long time operation, how can the users know it
> > still
> > > running instead of hang?
> > > - Controller / cluster side: ongoing reassignments and replication
> > > (metrics, kafka-reassign-partitions --list, Admin / JMX).
> > > - verify in another terminal shows progress toward the target.
> > > Batch wait is mostly quiet; incremental is a bit chattier; true
> progress
> > is
> > > best observed from cluster state or --verify, not only from stdout
> during
> > > the wait loop.
> > >
> > > Thanks,
> > > Manan Gupta
> > >
> > > On Mon, May 25, 2026 at 6:06 PM TaiJu Wu <[email protected]> wrote:
> > >
> > > > Hi Manan,
> > > >
> > > > Thanks for the KIP, just for some question.
> > > >
> > > > TJ00: If we have multiple batch requests, how do you handle single
> > batch
> > > > failure?
> > > >
> > > > TJ01: If there is a long time operation, how can the users know it
> > still
> > > > running instead of hang?
> > > >
> > > > Thanks,
> > > > TaiJuWu
> > > >
> > > >
> > > >
> > > > Manan Gupta <[email protected]> 於 2026年5月18日週一 下午6:09寫道:
> > > >
> > > > > Hey Kamal
> > > > >
> > > > > Thank you for your comments.
> > > > >
> > > > > > Should we have a configurable list poll interval?
> > > > > The current fixed interval of 500ms should not degrade the
> controller
> > > > but I
> > > > > agree that operators should have an option to change this value,
> > > updated
> > > > > the KIP to also take another parameter
> reassignment-poll-interval-ms
> > to
> > > > > update the default value from 500 ms.
> > > > >
> > > > > > Shall we extend the batching logic to also kafka-leader-election
> > > > script?
> > > > > Good point, I will pick this up as a separate KIP as a followup to
> > this
> > > > > KIP.
> > > > >
> > > > > Thanks,
> > > > > Manan
> > > > >
> > > > > On Mon, May 18, 2026 at 2:52 PM Kamal Chandraprakash <
> > > > > [email protected]> wrote:
> > > > >
> > > > > > Hi Manan,
> > > > > >
> > > > > > Thanks for improving the user-facing tools! Overall LGTM. Few
> > > > questions:
> > > > > >
> > > > > > 1. Should we have a configurable list poll interval? With 500ms,
> > does
> > > > it
> > > > > > poll the controller often to list the currently running
> > reassignments
> > > > for
> > > > > > large partitions?
> > > > > > 2. Shall we extend the batching logic to also
> kafka-leader-election
> > > > > script?
> > > > > > It will be useful when running with --all-topic-partitions.
> > > > > >
> > > > > > Thanks,
> > > > > > Kamal
> > > > > >
> > > > > >
> > > > > > On Mon, May 11, 2026 at 8:55 AM Manan Gupta <
> [email protected]>
> > > > > wrote:
> > > > > >
> > > > > > > Hello
> > > > > > >
> > > > > > > Gentle reminder to review the KIP.
> > > > > > >
> > > > > > > Thanks,
> > > > > > > Manan
> > > > > > >
> > > > > > > On Wed, May 6, 2026 at 7:52 PM Manan Gupta <
> [email protected]
> > >
> > > > > wrote:
> > > > > > >
> > > > > > > > Hi all,
> > > > > > > >
> > > > > > > > This email starts the discussion thread for *KIP-1335:
> Bounded
> > > > > > > > concurrency for partition reassignment via
> > > > > > kafka-reassign-partitions.sh*.
> > > > > > > > The proposal adds optional reassignment-batch-size and
> > > incremental
> > > > > > > > parameters to kafka-reassign-partitions.sh so operators can
> cap
> > > how
> > > > > > many
> > > > > > > > partition reassignments are submitted or kept in flight at
> once
> > > > using
> > > > > > > > existing Admin API,
> > > > > > > >
> > > > > > > > I will appreciate your initial thoughts and feedback on the
> > > > proposal.
> > > > > > > >
> > > > > > > > https://cwiki.apache.org/confluence/x/8ZAmGQ
> > > > > > > >
> > > > > > > > Thanks,
> > > > > > > > Manan
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
>

Reply via email to