Re: FLIP-484: Add custom metric variables to operators

Piotr Nowojski Fri, 06 Dec 2024 00:58:51 -0800

I hope my previous message has clarified the questions. If there are no
further comments,
I will open a voting thread later today.


Best,
Piotrek


wt., 3 gru 2024 o 12:18 Piotr Nowojski <pnowoj...@apache.org> napisał(a):

> Hi Romand and Hongshun,
>
> > 1. The scope of the variables added - is it only the last transformation?
> > Do I understand correctly, that chaining does NOT affect this scoping?
>
> > Could you please give a more detailed example how to do it? Currently,
> the
> > example only shows source.
>
> Setting the scope will work exactly the same way as setting `uid`, `name`,
> `parallelism` or `maxParallelism`. So it will be applied only to the last
> defined transformation.
> For example:
>
>         DataStream<Tuple2<String, Integer>> dataStream = env
>                 .socketTextStream("localhost", 9999)
>                 .addMetricVariable("foo1", "bar1")
>                 .flatMap(new Splitter())
>                 .addMetricVariable("foo2", "bar2")
>                 .keyBy(value -> value.f0)
>                 .window(TumblingProcessingTimeWindows.of(Time.seconds(5)))
>                 .sum(1)
>                 .addMetricVariable("foo3", "bar3");
>
> Chaining doesn't affect additional metric variables.
>
> > 2. Is Python API going to be supported as well?
>
> Unfortunately not initially.
>
> Best,
> Piotrek
>
> wt., 3 gru 2024 o 03:27 Hongshun Wang <loserwang1...@gmail.com>
> napisał(a):
>
>> Hi Piotr,
>>   thanks for the proposal.
>>
>> >  it would be helpful to group metrics from each source/sink instance
>> together,
>> Could you please give a more detailed example how to do it? Currently, the
>> example only shows source.
>>
>> Thanks,
>> Hongshun
>>
>> On Thu, Nov 14, 2024 at 8:36 PM Roman Khachatryan <ro...@apache.org>
>> wrote:
>>
>> > Hi Piotr, thanks for the proposal,
>> >
>> > Can you please clarify
>> > 1. The scope of the variables added - is it only the last
>> transformation?
>> > Do I understand correctly, that chaining does NOT affect this scoping?
>> >
>> > 2. Is Python API going to be supported as well?
>> >
>> > Thanks
>> >
>> > Regards,
>> > Roman
>> >
>> >
>> > On Thu, Nov 7, 2024 at 2:52 PM Piotr Nowojski <pnowoj...@apache.org>
>> > wrote:
>> >
>> > > Hi all!
>> > >
>> > > I would like to open up for discussion a new FLIP-484 [1].
>> > >
>> > > Motivation
>> > > When running a Flink job that reads data from multiple sources and
>> writes
>> > > to multiple sinks, it would be helpful to group metrics from each
>> > > source/sink instance together, for example based on the underlying
>> name
>> > of
>> > > the source/sink table/topic.
>> > >
>> > > Proposed Changes
>> > > This flip proposes to allow users to set custom metric variables to
>> > > operators/transformations, that will be later passed on to the metrics
>> > and
>> > > trace reporters.
>> > >
>> > > This will allow users to label all of the reported metrics from
>> operators
>> > > with for example the table name (SQL), allowing to easily report the
>> > number
>> > > of records/watermark values per each source/sink table in the same
>> Flink
>> > > Job.
>> > >
>> > > For more information please look into the FLIP-484 [1].
>> > >
>> > > I'm looking forward to your thoughts on this.
>> > >
>> > > Best,
>> > > Piotrek
>> > >
>> > > [1] https://cwiki.apache.org/confluence/x/44yMEw
>> > > <https://cwiki.apache.org/confluence/x/4IyMEw>
>> > >
>> >
>>
>

Re: FLIP-484: Add custom metric variables to operators

Reply via email to