Hey Brenden, thanks for the KIP! Great to get more visibility into the new
consumer.

LM1. +1 on Lucas's suggestion for including the unit in the name, seems
clearer and consistent (I do see several time metrics including ms)

LM2. What about a new metric for application-event-queue-time-ms. It would
be a complement to the application-event-queue-size you're proposing, and
it will tell us how long the events sit in the queue, waiting to be
processed (from the time the API call adds the event to the queue, to the
time it's processed in the background thread). I find it would be very
interesting.

LM3. Thinking about the actual usage of
"time-between-network-thread-poll-xxx" metric, I imagine it would be
helpful to know more about what could be impacting it. As I see it, the
network thread cadence could be mainly impacted by: 1- app event processing
(generate requests), 2- network client poll (actual send/receive). For 2,
the new consumer reuses the same component as the legacy one, but 1 is
specific to the new consumer, so what about a metric
for application-event-processing-time-ms (we could consider avg I would
say). It would be the time that the network thread takes to process all
available events on each run.

Cheers!
Lianet

On Fri, Jul 12, 2024 at 1:57 PM Lucas Brutschy
<lbruts...@confluent.io.invalid> wrote:

> Hey Brenden,
>
> thanks for the KIP! These will be great to observe and debug the
> background thread of the new consumer.
>
> LB1. `time-between-network-thread-poll-max` → I see several similar
> metrics including the unit in the metric name (ms or us). We could
> consider this, although it's probably not strictly required. However,
> at least the description should state the unit. Same for the `avg`
> version.
> LB2. `unsent-requests-size` → Naming sounds a bit like it's referring
> to the size of the request. How about `unsent-request-queue-size` or
> `unsent-request-count` or simply `unsent-requests`?
> LB3. "the proposed metrics below will be set to null or 0." → which
> one will be set to null and which ones will be set to 0, and why?
>
> nit: "The current number of unsent requests in the consumer network" →
> Seems to be missing something?
>
> Cheers,
> Lucas
>
> On Fri, Jul 12, 2024 at 7:28 PM Brenden Deluna
> <bdel...@confluent.io.invalid> wrote:
> >
> > Hi Andrew,
> > Thank you for the feedback and your question.
> >
> > AS1. Great idea, I will get that added.
> >
> > AS2. For unsent-events-age-max, age will be calculated once the event is
> > sent, so you are correct.
> >
> > AS3. I agree, I think that would be a helpful metric to add, thank you! I
> > will get that added.
> >
> > Please let me know if you have any additional feedback, suggestions, or
> > questions.
> >
> > Thanks,
> > Brenden
> >
> > On Fri, Jul 12, 2024 at 11:45 AM Andrew Schofield <
> andrew_schofi...@live.com>
> > wrote:
> >
> > > Hi Brenden,
> > > Thanks for the KIP. It fills a gap in the metrics for the new consumer
> > > nicely.
> > >
> > > AS1. If using the CLASSIC protocol, and thus the LegacyKafkaConsumer,
> > > I would expect that the metrics do not exist at all. Maybe say
> something
> > > like
> > > “These metrics are for the new consumer implementation using the
> > > CONSUMER protocol”.
> > >
> > > AS2. For unsent-events-age-max, when is the age calculated? For
> example,
> > > is it calculated at the time that the unsent event is removed from the
> > > list and sent, or does the metric reflect unsent events which are still
> > > enqueued? I suspect the former, but thought I’d check.
> > >
> > > AS3. I think that unsent-events-age-avg would also be interesting to
> > > get an idea of how long unsent events tend to sit around before
> sending.
> > > Of course, the question from AS2 would also apply to the average.
> > >
> > > Thanks,
> > > Andrew
> > >
> > > > On 10 Jul 2024, at 17:44, Philip Nee <p...@confluent.io.INVALID>
> wrote:
> > > >
> > > > Hi all,
> > > >
> > > > This is the link to the KIP document.
> > > >
> > >
> https://cwiki.apache.org/confluence/display/KAFKA/KIP-1068%3A+New+JMX+metrics+for+the+new+KafkaConsumer
> > > >
> > > > Any comment is appreciated,
> > > >
> > > >
> > > > On Tue, Jul 9, 2024 at 10:14 AM Brenden Deluna
> > > <bdel...@confluent.io.invalid>
> > > > wrote:
> > > >
> > > >> Hello everyone,
> > > >>
> > > >> I would like to start the discussion thread for KIP-1068. This is a
> > > >> relatively small KIP, only proposing to add a couple of new metrics.
> > > >>
> > > >> If you have any suggestions or feedback, let me know, it will be
> much
> > > >> appreciated.
> > > >>
> > >
> > >
>

Reply via email to