Hi, Mickael,

Thanks for the explanation. The KIP looks to me now.

Jun

On Tue, May 31, 2022 at 6:44 AM Mickael Maison <mickael.mai...@gmail.com>
wrote:

> Hi Jun,
>
> Igor answered to your question.
> Users should rely on their host metrics to monitor disk usage. But
> with tooling and automation it's sometimes not ideal to retrieve
> values from metrics. So exposing disk usage via the Kafka API will
> simplify coordinating disk operations.
>
> I've updated the motivation section to make it clear why being able to
> retrieve these values via the API rather than metrics is interesting.
>
> Thanks,
> Mickael
>
> On Tue, May 31, 2022 at 3:38 PM Mickael Maison <mickael.mai...@gmail.com>
> wrote:
> >
> > Hi Raman,
> >
> > Unfortunately the replica size only includes the log files and it does
> > not include indexes or other metadata files. Obviously any extra non
> > Kafka files are also not included either.
> >
> > For these reasons, I decided to have a separate field with the actual
> > usable space reported by the volume.
> >
> > Thanks,
> > Mickael
> >
> >
> >
> > On Tue, May 31, 2022 at 11:57 AM Mickael Maison
> > <mickael.mai...@gmail.com> wrote:
> > >
> > > Hi Cong,
> > >
> > > Kafka does not expose disk utilization metrics. This is something you
> > > need to provide in your environment. You definitively should have a
> > > mechanism for exposing metrics from your Kafka broker hosts and you
> > > should absolutely monitor disk usage and have appropriate alerts.
> > >
> > > Thanks,
> > > Mickael
> > >
> > > On Thu, May 26, 2022 at 7:34 PM Jun Rao <j...@confluent.io.invalid>
> wrote:
> > > >
> > > > Hi, Igor,
> > > >
> > > > Thanks for the reply.
> > > >
> > > > I agree that this KIP could be useful for improving the tool for
> moving
> > > > data across disks. It would be useful to clarify on the main
> motivation of
> > > > the KIP. Also, DescribeLogDirsResponse already includes the size of
> each
> > > > partition on a disk. So, it seems that UsableBytes is redundant
> since it's
> > > > derivable.
> > > >
> > > > Thanks,
> > > >
> > > > Jun
> > > >
> > > > On Thu, May 26, 2022 at 3:30 AM Igor Soarez <i...@soarez.me> wrote:
> > > >
> > > > > Hi,
> > > > >
> > > > > This can also be quite useful to make better use of existing
> functionality
> > > > > in the Kafka API — moving replicas between log directories via
> > > > > ALTER_REPLICA_LOG_DIRS. If usable space information is also
> available the
> > > > > caller can make better decisions using the same API. It means a
> more
> > > > > consistent way of interacting with Kafka to manage replicas
> locations
> > > > > within a broker without having to correlate Kafka metrics with
> information
> > > > > from the Kafka API.
> > > > >
> > > > > --
> > > > > Igor
> > > > >
> > > > > On Wed, May 25, 2022, at 8:16 PM, Jun Rao wrote:
> > > > > > Hi, Mickael,
> > > > > >
> > > > > > Thanks for the KIP.  Since this is mostly for monitoring and
> alerting,
> > > > > > could we expose them as metrics instead of as part of the API?
> We already
> > > > > > have a size metric per log. Perhaps we could extend that to add
> > > > > used/total
> > > > > > metrics per disk?
> > > > > >
> > > > > > Thanks,
> > > > > >
> > > > > > Jun
> > > > > >
> > > > > > On Thu, May 19, 2022 at 10:21 PM Raman Verma
> <rve...@confluent.io.invalid
> > > > > >
> > > > > > wrote:
> > > > > >
> > > > > >> Hello Mikael,
> > > > > >>
> > > > > >> Thanks for the KIP.
> > > > > >>
> > > > > >> I see that the API response contains some information about each
> > > > > partition.
> > > > > >> ```
> > > > > >> { "name": "PartitionSize", "type": "int64", "versions": "0+",
> > > > > >>   "about": "The size of the log segments in this partition in
> bytes." }
> > > > > >> ```
> > > > > >> Can this be summed up to provide a used space in a `log.dir`
> > > > > >> This will also be specific to a `log.dir` (for the case where
> multiple
> > > > > >> log.dir are hosted on the same underlying device)
> > > > > >>
> > > > > >> On Thu, May 19, 2022 at 10:21 AM Cong Ding
> <cd...@confluent.io.invalid>
> > > > > >> wrote:
> > > > > >> >
> > > > > >> > Hey Mickael,
> > > > > >> >
> > > > > >> > Great KIP!
> > > > > >> >
> > > > > >> > I have one question:
> > > > > >> >
> > > > > >> > You mentioned "DescribeLogDirs is usually a low volume API.
> This
> > > > > change
> > > > > >> > should not
> > > > > >> > significantly affect the latency of this API." and "That
> would allow
> > > > > to
> > > > > >> > easily validate whether disk operations (like a resize), or
> topic
> > > > > >> deletion
> > > > > >> > (log deletion only happen after a short delay) have
> completed." I
> > > > > wonder
> > > > > >> if
> > > > > >> > there is an existing metric/API that can allow administrators
> to
> > > > > >> determine
> > > > > >> > whether we need to resize? If administrators use this API to
> determine
> > > > > >> > whether we need a resize, would this API become a high-volume
> API? I
> > > > > >> > understand we don't want this API to be a high-volume one
> because the
> > > > > API
> > > > > >> > is already costly by returning `"name": "Topics"`.
> > > > > >> >
> > > > > >> > Cong
> > > > > >> >
> > > > > >> > On Thu, Apr 7, 2022 at 2:17 AM Mickael Maison <
> > > > > mickael.mai...@gmail.com>
> > > > > >> > wrote:
> > > > > >> >
> > > > > >> > > Hi,
> > > > > >> > >
> > > > > >> > > I wrote a small KIP to expose the total and usable space of
> logdirs
> > > > > >> > > via the DescribeLogDirs API:
> > > > > >> > >
> > > > > >> > >
> > > > > >>
> > > > >
> https://cwiki.apache.org/confluence/display/KAFKA/KIP-827%3A+Expose+logdirs+total+and+usable+space+via+Kafka+API
> > > > > >> > >
> > > > > >> > > Please take a look and let me know if you have any feedback.
> > > > > >> > >
> > > > > >> > > Thanks,
> > > > > >> > > Mickael
> > > > > >> > >
> > > > > >>
> > > > > >>
> > > > > >>
> > > > > >> --
> > > > > >> Best Regards,
> > > > > >> Raman Verma
> > > > > >>
> > > > >
>

Reply via email to