OK, KIP updated:
 - added overloads taking `Materialized`
 - dropped overloads taking `ValueTransformerSupplier` in favour of the
`withKey` variants.
 - added more info around the limitations of the ProcessorContext passed in
to the transformer's init calls, i.e. no forward calls allowed or calls to
getStateStore where the store name matches the materialized result of the
call.

I'll sort out the PR next.

On 11 May 2018 at 10:26, Damian Guy <damian....@gmail.com> wrote:

> I'm a +1 for Guozhang's suggestion
>
> On Fri, 11 May 2018 at 10:20 Andy Coates <a...@confluent.io> wrote:
>
> > Makes sense to me.  What do others think?
> >
> > On 11 May 2018 at 10:13, Guozhang Wang <wangg...@gmail.com> wrote:
> >
> > > Hi folks,
> > >
> > > While looking into the overloaded functions, I'm wondering if we can
> save
> > > the transformers without key, i.e. only add two overloaded functions:
> > >
> > > <VR> KTable<K, VR> transformValues(final
> > ValueTransformerWithKeySupplier<?
> > > super K, ? super V, ? extends VR> valueTransformerSupplier,
> > >                                    final String... stateStoreNames);
> > >
> > > <VR> KTable<K, VR> transformValues(final
> > ValueTransformerWithKeySupplier<?
> > > super K, ? super V, ? extends VR> valueTransformerSupplier,
> > >                                    final Materialized<K, VR,
> > > KeyValueStore<Bytes, byte[]>> materialized,
> > >                                    final String... stateStoreNames);
> > >
> > > The reason is that, in KIP-149 we've added the overloaded functions
> > > `withKey`, which should be covering the case without key already
> because
> > if
> > > users do not really need the key, they can just take it as a dummy
> > > parameter. We did not deprecate the old ones since some of them have
> just
> > > been added one version back. But if we agree that by the end of the day
> > we
> > > would only maintain the overloaded value functions "with key" only,
> then
> > we
> > > should not add the ones without keys any more in new KIPs.
> > >
> > > WDYT?
> > >
> > >
> > > Guozhang
> > >
> > >
> > > On Fri, May 11, 2018 at 9:42 AM, Andy Coates <a...@confluent.io>
> wrote:
> > >
> > > > Sorry for my lack of response - I've been out of action with a bad
> back
> > > for
> > > > a few days!
> > > >
> > > > I originally had the `Materialized` overloads added to the API. I'll
> > > update
> > > > the KIP / PR with these again. In terms of semantics, as Matthias
> > > suggests,
> > > > these should be consistent with filter() and mapValues(), etc.
> > > >
> > > > On 8 May 2018 at 17:59, Guozhang Wang <wangg...@gmail.com> wrote:
> > > >
> > > > > To follow on Matthias and Damian's comments here:
> > > > >
> > > > > If we are going to add the overload functions as
> > > > >
> > > > > ```
> > > > > <VR> KTable<K, VR> transformValues(final ValueTransformerSupplier<?
> > > super
> > > > > V,
> > > > > ? extends VR> valueTransformerSupplier,
> > > > >                                    final String... stateStoreNames,
> > > > >                                    final Materialized<K,
> > > > > VR, KeyValueStore<Bytes, byte[]> materialized);
> > > > >
> > > > > <VR> KTable<K, VR> transformValues(final
> > ValueTransformerWithKeySupplie
> > > > r<?
> > > > > super K, ? super V, ? extends VR> valueTransformerSupplier,
> > > > >                                    final String... stateStoreNames,
> > > > >                                  final Materialized<K,
> > > > > VR, KeyValueStore<Bytes, byte[]> materialized);
> > > > > ```
> > > > >
> > > > > Then are we going to still only allow the valueTransofmer.init() /
> > > > > process() to be able to access N stores, with N stores specified
> with
> > > the
> > > > > stateStoreNames, but not the one specified in materialized.name()?
> > > > > Personally I think it should be the case as the materialized store
> > > should
> > > > > be managed by the Streams library itself, but we should probably
> help
> > > > users
> > > > > to understand if they have some stores used for the same purpose
> > > (storing
> > > > > the value that are going to be sent to the downstream changelog
> > stream
> > > of
> > > > > KTable), they should save that store and not creating by themselves
> > as
> > > it
> > > > > will be auto created by the Streams library.
> > > > >
> > > > >
> > > > > Guozhang
> > > > >
> > > > >
> > > > >
> > > > >
> > > > > On Tue, May 8, 2018 at 7:45 AM, Damian Guy <damian....@gmail.com>
> > > wrote:
> > > > >
> > > > > > Initially i thought materializing a store would be overkill, but
> > > from a
> > > > > > consistency point of view it makes sense to add an overload that
> > > takes
> > > > a
> > > > > > `Materialized` and only create the store if that is supplied.
> > > > > >
> > > > > > On Sun, 6 May 2018 at 17:52 Matthias J. Sax <
> matth...@confluent.io
> > >
> > > > > wrote:
> > > > > >
> > > > > > > Andy,
> > > > > > >
> > > > > > > thanks for the KIP. I don't have any further comments.
> > > > > > >
> > > > > > > My 2cents about Guozhang's questions: as I like consistent
> > > behavior,
> > > > I
> > > > > > > think transfromValues() should behave the same way as filter()
> > and
> > > > > > > mapValues().
> > > > > > >
> > > > > > >
> > > > > > > -Matthias
> > > > > > >
> > > > > > > On 5/2/18 2:24 PM, Guozhang Wang wrote:
> > > > > > > > Hello Andy,
> > > > > > > >
> > > > > > > > Thanks for the KIP. The motivation and the general proposal
> > looks
> > > > > good
> > > > > > to
> > > > > > > > me. I think in KTable it is indeed valuable to add the
> > functions
> > > > that
> > > > > > > does
> > > > > > > > not change key, such as mapValues, transformValues, and
> filter.
> > > > > > > >
> > > > > > > > There are a few meta comments I have about the semantics of
> the
> > > > newly
> > > > > > > added
> > > > > > > > functions:
> > > > > > > >
> > > > > > > > 1) For the resulted KTable, how should its
> > "queryableStoreName()"
> > > > be
> > > > > > > > returning?
> > > > > > > >
> > > > > > > > 2) More specifically, how do we decide if the resulted KTable
> > is
> > > to
> > > > > be
> > > > > > > > materialized or not? E.g. if there is no store names provided
> > > then
> > > > it
> > > > > > is
> > > > > > > > likely that the resulted KTable is not materialized, or at
> > least
> > > > not
> > > > > > > > logically materialized and not be queryable. What if there is
> > at
> > > > > least
> > > > > > > one
> > > > > > > > state store provided? Will any of them be provided as the
> > > > > materialized
> > > > > > > > store, or should we still add a Materialized parameter for
> this
> > > > > > purpose?
> > > > > > > >
> > > > > > > > 3) For its internal implementations, how should the key/value
> > > > serde,
> > > > > > > > sendOldValues flag etc be inherited from its parent processor
> > > node?
> > > > > > > >
> > > > > > > >
> > > > > > > > Guozhang
> > > > > > > >
> > > > > > > >
> > > > > > > > On Wed, May 2, 2018 at 12:43 PM, Andy Coates <
> > a...@confluent.io>
> > > > > > wrote:
> > > > > > > >
> > > > > > > >> Hi everyone,
> > > > > > > >>
> > > > > > > >> I would like to start a discussion for KIP 292. I would
> > > appreciate
> > > > > it
> > > > > > if
> > > > > > > >> you could review and provide feedback.
> > > > > > > >>
> > > > > > > >> KIP: KIP-292: Add transformValues() method to KTable
> > > > > > > >> <https://cwiki.apache.org/confluence/display/KAFKA/KIP-
> > > > > > > >> 292%3A+Add+transformValues%28%29+method+to+KTable>
> > > > > > > >> Jira: KAFKA-6849 <https://issues.apache.org/
> > > > jira/browse/KAFKA-6849>
> > > > > > > >>
> > > > > > > >>    PR: #4959 <https://github.com/apache/kafka/pull/4959>
> > > > > > > >>
> > > > > > > >>
> > > > > > > >>
> > > > > > > >> Thanks,
> > > > > > > >>
> > > > > > > >> Andy
> > > > > > > >>
> > > > > > > >
> > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > > >
> > > > >
> > > > > --
> > > > > -- Guozhang
> > > > >
> > > >
> > >
> > >
> > >
> > > --
> > > -- Guozhang
> > >
> >
>

Reply via email to