Hey, Thank you for proposing this! Sounds really useful - we have definitely seem some difficult to explain pauses in consumer activity and this metric will let us correlate those.
Few questions: 1. Did you consider adding both Max and Avg metrics? Many of our metrics have both (batch-size and message-size for example) and it helps put the max value in context. 2. You wrote: "Lengthening or shortening the 3 hour time window is up for discussion (default is 30sec)." and I'm not sure what default you are referring to? 3. Can you also give some background on why you are proposing 3h? I'm guessing it is because loading the state from the topic happens rarely enough that in 3h it will probably only happen once or not at all? Perhaps we need a rate metric to see how often it actually happens (if we have to reload offsets very often it is a different problem). Gwen On Tue, Jun 25, 2019 at 4:43 PM Anastasia Vela <av...@confluent.io> wrote: > > Hi all, > > I'd like to discuss KIP-484: > https://cwiki.apache.org/confluence/display/KAFKA/KIP-484%3A+Expose+metrics+for+group+and+transaction+metadata+loading+duration > > Let me know what you think! > > Thanks, > Anastasia -- Gwen Shapira Product Manager | Confluent 650.450.2760 | @gwenshap Follow us: Twitter | blog