Thanks for reviving this thread, Reetika!
When I took a look at this last year there was only a way to expose
query-level metrics but the manifest file content cache is rather on a
process-level. Hence I brought up this topic but apparently there was no
interest coming from the community so I haven't investigated further.
However, if there is an interest now, I think I can take another look to
find a way to expose these metrics so that the clients could pick them up
and use them for their purposes.

I'd also appreciate to hear the opinion from committers/PMC to see where
the pitfalls are for instance.

Cheers,
Gabor


On Sun, Nov 5, 2023 at 7:51 PM Reetika Agrawal <agrawal.reetika...@gmail.com>
wrote:

> Hi Iceberg Community,
>
> I was also exploring Manifest file caching feature which was introduced in
> Iceberg https://github.com/apache/iceberg/pull/4518 for Presto execution
> engine. And as I checked Cache Metrics are getting computed and recorded in
> Iceberg library for same, but didn’t find a way to access these metrics on
> execution engine side. To validate the cache usage we would require atleast
> cache metrics access for this feature. Could you please let me know how
> could I do that?
>
> Thanks,
> Reetika
>
> On 2022/12/09 15:40:04 Gabor Kaszab wrote:
> > Hey Iceberg Community,
> >
> > Scan metrics support was introduced in 1.1.0 that is pretty nice to
> collect
> > some per-query stats like 'total planning duration' and such.
> > However, I haven't found a way to get process level metrics at the
> moment.
> > The particular thing I wanted to achieve is that there was a Manifest
> file
> > content cache <https://github.com/apache/iceberg/pull/4518> introduced
> > recently where the stats are logged, but aren't exposed through the API.
> >
> > Such an API could be used to expose stats for various kinds of caches
> (e.g.
> > number of hits and misses), we could count the number of specific API
> calls
> > (like planFiles()) and I guess many more other useful stuff.
> >
> > With this mail I'd like to initiate a discussion around this topic. If
> the
> > community is open for such a change I could start working on an initial
> > solution that would expose the Manifest file content cache stats but
> could
> > be extended later on with more metrics.
> >
> > Cheers,
> > Gabor
> >
>

Reply via email to