Re: [DISCUSS] KIP-150 - Kafka-Streams Cogroup

Eno Thereska Tue, 16 May 2017 01:26:04 -0700

Hi Kyle,

Sorry for the delay in reviews, tomorrow is feature freeze deadline 
(https://cwiki.apache.org/confluence/display/KAFKA/Release+Plan+0.11.0.0 
<https://cwiki.apache.org/confluence/display/KAFKA/Release+Plan+0.11.0.0>) so 
people are busier than usual. Stay tuned.


Eno
> On 15 May 2017, at 13:25, Kyle Winkelman <[email protected]> wrote:
> 
> Damian Guy, could you let me know if you plan to review this further? There
> is no rush, but if you dont have any additional comments I could start the
> voting and finish my WIP PR.
> 
> Thanks,
> Kyle
> 
> On May 9, 2017 11:07 AM, "Kyle Winkelman" <[email protected]> wrote:
> 
>> Eno, is there anyone else that is an expert in the kafka streams realm
>> that I should reach out to for input?
>> 
>> I believe Damian Guy is still planning on reviewing this more in depth so
>> I will wait for his inputs before continuing.
>> 
>> On May 9, 2017 7:30 AM, "Eno Thereska" <[email protected]> wrote:
>> 
>>> Thanks Kyle, good arguments.
>>> 
>>> Eno
>>> 
>>>> On May 7, 2017, at 5:06 PM, Kyle Winkelman <[email protected]>
>>> wrote:
>>>> 
>>>> *- minor: could you add an exact example (similar to what Jay’s example
>>> is,
>>>> or like your Spark/Pig pointers had) to make this super concrete?*
>>>> I have added a more concrete example to the KIP.
>>>> 
>>>> *- my main concern is that we’re exposing this optimization to the DSL.
>>> In
>>>> an ideal world, an optimizer would take the existing DSL and do the
>>> right
>>>> thing under the covers (create just one state store, arrange the nodes
>>>> etc). The original DSL had a bunch of small, composable pieces (group,
>>>> aggregate, join) that this proposal groups together. I’d like to hear
>>> your
>>>> thoughts on whether it’s possible to do this optimization with the
>>> current
>>>> DSL, at the topology builder level.*
>>>> You would have to make a lot of checks to understand if it is even
>>> possible
>>>> to make this optimization:
>>>> 1. Make sure they are all KTableKTableOuterJoins
>>>> 2. None of the intermediate KTables are used for anything else.
>>>> 3. None of the intermediate stores are used. (This may be impossible
>>>> especially if they use KafkaStreams#store after the topology has already
>>>> been built.)
>>>> You would then need to make decisions during the optimization:
>>>> 1. Your new initializer would the composite of all the individual
>>>> initializers and the valueJoiners.
>>>> 2. I am having a hard time thinking about how you would turn the
>>>> aggregators and valueJoiners into an aggregator that would work on the
>>>> final object, but this may be possible.
>>>> 3. Which state store would you use? The ones declared would be for the
>>>> aggregate values. None of the declared ones would be guaranteed to hold
>>> the
>>>> final object. This would mean you must created a new state store and not
>>>> created any of the declared ones.
>>>> 
>>>> The main argument I have against it is even if it could be done I don't
>>>> know that we would want to have this be an optimization in the
>>> background
>>>> because the user would still be required to think about all of the
>>>> intermediate values that they shouldn't need to worry about if they only
>>>> care about the final object.
>>>> 
>>>> In my opinion cogroup is a common enough case that it should be part of
>>> the
>>>> composable pieces (group, aggregate, join) because we want to allow
>>> people
>>>> to join more than 2 or more streams in an easy way. Right now I don't
>>> think
>>>> we give them ways of handling this use case easily.
>>>> 
>>>> *-I think there will be scope for several such optimizations in the
>>> future
>>>> and perhaps at some point we need to think about decoupling the 1:1
>>> mapping
>>>> from the DSL into the physical topology.*
>>>> I would argue that cogroup is not just an optimization it is a new way
>>> for
>>>> the users to look at accomplishing a problem that requires multiple
>>>> streams. I may sound like a broken record but I don't think users should
>>>> have to build the N-1 intermediate tables and deal with their
>>> initializers,
>>>> serdes and stores if all they care about is the final object.
>>>> Now if for example someone uses cogroup but doesn't supply additional
>>>> streams and aggregators this case is equivalent to a single grouped
>>> stream
>>>> making an aggregate call. This case is what I view an optimization as,
>>> we
>>>> could remove the KStreamCogroup and act as if there was just a call to
>>>> KGroupedStream#aggregate instead of calling KGroupedStream#cogroup. (I
>>>> would prefer to just write a warning saying that this is not how
>>> cogroup is
>>>> to be used.)
>>>> 
>>>> Thanks,
>>>> Kyle
>>>> 
>>>> On Sun, May 7, 2017 at 5:41 AM, Eno Thereska <[email protected]>
>>> wrote:
>>>> 
>>>>> Hi Kyle,
>>>>> 
>>>>> Thanks for the KIP again. A couple of comments:
>>>>> 
>>>>> - minor: could you add an exact example (similar to what Jay’s example
>>> is,
>>>>> or like your Spark/Pig pointers had) to make this super concrete?
>>>>> 
>>>>> - my main concern is that we’re exposing this optimization to the DSL.
>>> In
>>>>> an ideal world, an optimizer would take the existing DSL and do the
>>> right
>>>>> thing under the covers (create just one state store, arrange the nodes
>>>>> etc). The original DSL had a bunch of small, composable pieces (group,
>>>>> aggregate, join) that this proposal groups together. I’d like to hear
>>> your
>>>>> thoughts on whether it’s possible to do this optimization with the
>>> current
>>>>> DSL, at the topology builder level.
>>>>> 
>>>>> I think there will be scope for several such optimizations in the
>>> future
>>>>> and perhaps at some point we need to think about decoupling the 1:1
>>> mapping
>>>>> from the DSL into the physical topology.
>>>>> 
>>>>> Thanks
>>>>> Eno
>>>>> 
>>>>>> On May 5, 2017, at 4:39 PM, Jay Kreps <[email protected]> wrote:
>>>>>> 
>>>>>> I haven't digested the proposal but the use case is pretty common. An
>>>>>> example would be the "customer 360" or "unified customer profile" use
>>>>> case
>>>>>> we often use. In that use case you have a dozen systems each of which
>>> has
>>>>>> some information about your customer (account details, settings,
>>> billing
>>>>>> info, customer service contacts, purchase history, etc). Your goal is
>>> to
>>>>>> join/munge these into a single profile record for each customer that
>>> has
>>>>>> all the relevant info in a usable form and is up-to-date with all the
>>>>>> source systems. If you implement that with kstreams as a sequence of
>>>>> joins
>>>>>> i think today we'd fully materialize N-1 intermediate tables. But
>>> clearly
>>>>>> you only need a single stage to group all these things that are
>>> already
>>>>>> co-partitioned. A distributed database would do this under the covers
>>>>> which
>>>>>> is arguably better (at least when it does the right thing) and
>>> perhaps we
>>>>>> could do the same thing but I'm not sure we know the partitioning so
>>> we
>>>>> may
>>>>>> need an explicit cogroup command that impllies they are already
>>>>>> co-partitioned.
>>>>>> 
>>>>>> -Jay
>>>>>> 
>>>>>> On Fri, May 5, 2017 at 5:56 AM, Kyle Winkelman <
>>> [email protected]
>>>>>> 
>>>>>> wrote:
>>>>>> 
>>>>>>> Yea thats a good way to look at it.
>>>>>>> I have seen this type of functionality in a couple other platforms
>>> like
>>>>>>> spark and pig.
>>>>>>> https://spark.apache.org/docs/0.6.2/api/core/spark/
>>>>> PairRDDFunctions.html
>>>>>>> https://www.tutorialspoint.com/apache_pig/apache_pig_
>>>>> cogroup_operator.htm
>>>>>>> 
>>>>>>> 
>>>>>>> On May 5, 2017 7:43 AM, "Damian Guy" <[email protected]> wrote:
>>>>>>> 
>>>>>>>> Hi Kyle,
>>>>>>>> 
>>>>>>>> If i'm reading this correctly it is like an N way outer join? So an
>>>>> input
>>>>>>>> on any stream will always produce a new aggregated value - is that
>>>>>>> correct?
>>>>>>>> Effectively, each Aggregator just looks up the current value,
>>>>> aggregates
>>>>>>>> and forwards the result.
>>>>>>>> I need to look into it and think about it a bit more, but it seems
>>> like
>>>>>>> it
>>>>>>>> could be a useful optimization.
>>>>>>>> 
>>>>>>>> On Thu, 4 May 2017 at 23:21 Kyle Winkelman <
>>> [email protected]>
>>>>>>>> wrote:
>>>>>>>> 
>>>>>>>>> I sure can. I have added the following description to my KIP. If
>>> this
>>>>>>>>> doesn't help let me know and I will take some more time to build a
>>>>>>>> diagram
>>>>>>>>> and make more of a step by step description:
>>>>>>>>> 
>>>>>>>>> Example with Current API:
>>>>>>>>> 
>>>>>>>>> KTable<K, V1> table1 =
>>>>>>>>> builder.stream("topic1").groupByKey().aggregate(initializer1,
>>>>>>>> aggregator1,
>>>>>>>>> aggValueSerde1, storeName1);
>>>>>>>>> KTable<K, V2> table2 =
>>>>>>>>> builder.stream("topic2").groupByKey().aggregate(initializer2,
>>>>>>>> aggregator2,
>>>>>>>>> aggValueSerde2, storeName2);
>>>>>>>>> KTable<K, V3> table3 =
>>>>>>>>> builder.stream("topic3").groupByKey().aggregate(initializer3,
>>>>>>>> aggregator3,
>>>>>>>>> aggValueSerde3, storeName3);
>>>>>>>>> KTable<K, CG> cogrouped = table1.outerJoin(table2,
>>>>>>>>> joinerOneAndTwo).outerJoin(table3, joinerOneTwoAndThree);
>>>>>>>>> 
>>>>>>>>> As you can see this creates 3 StateStores, requires 3 initializers,
>>>>>>> and 3
>>>>>>>>> aggValueSerdes. This also adds the pressure to user to define what
>>> the
>>>>>>>>> intermediate values are going to be (V1, V2, V3). They are left
>>> with a
>>>>>>>>> couple choices, first to make V1, V2, and V3 all the same as CG and
>>>>> the
>>>>>>>> two
>>>>>>>>> joiners are more like mergers, or second make them intermediate
>>> states
>>>>>>>> such
>>>>>>>>> as Topic1Map, Topic2Map, and Topic3Map and the joiners use those to
>>>>>>> build
>>>>>>>>> the final aggregate CG value. This is something the user could
>>> avoid
>>>>>>>>> thinking about with this KIP.
>>>>>>>>> 
>>>>>>>>> When a new input arrives lets say at "topic1" it will first go
>>> through
>>>>>>> a
>>>>>>>>> KStreamAggregate grabbing the current aggregate from storeName1. It
>>>>>>> will
>>>>>>>>> produce this in the form of the first intermediate value and get
>>> sent
>>>>>>>>> through a KTableKTableOuterJoin where it will look up the current
>>>>> value
>>>>>>>> of
>>>>>>>>> the key in storeName2. It will use the first joiner to calculate
>>> the
>>>>>>>> second
>>>>>>>>> intermediate value, which will go through an additional
>>>>>>>>> KTableKTableOuterJoin. Here it will look up the current value of
>>> the
>>>>>>> key
>>>>>>>> in
>>>>>>>>> storeName3 and use the second joiner to build the final aggregate
>>>>>>> value.
>>>>>>>>> 
>>>>>>>>> If you think through all possibilities for incoming topics you will
>>>>> see
>>>>>>>>> that no matter which topic it comes in through all three stores are
>>>>>>>> queried
>>>>>>>>> and all of the joiners must get used.
>>>>>>>>> 
>>>>>>>>> Topology wise for N incoming streams this creates N
>>>>>>>>> KStreamAggregates, 2*(N-1) KTableKTableOuterJoins, and N-1
>>>>>>>>> KTableKTableJoinMergers.
>>>>>>>>> 
>>>>>>>>> 
>>>>>>>>> 
>>>>>>>>> Example with Proposed API:
>>>>>>>>> 
>>>>>>>>> KGroupedStream<K, V1> grouped1 = builder.stream("topic1").
>>>>>>> groupByKey();
>>>>>>>>> KGroupedStream<K, V2> grouped2 = builder.stream("topic2").
>>>>>>> groupByKey();
>>>>>>>>> KGroupedStream<K, V3> grouped3 = builder.stream("topic3").
>>>>>>> groupByKey();
>>>>>>>>> KTable<K, CG> cogrouped = grouped1.cogroup(initializer1,
>>> aggregator1,
>>>>>>>>> aggValueSerde1, storeName1)
>>>>>>>>>      .cogroup(grouped2, aggregator2)
>>>>>>>>>      .cogroup(grouped3, aggregator3)
>>>>>>>>>      .aggregate();
>>>>>>>>> 
>>>>>>>>> As you can see this creates 1 StateStore, requires 1 initializer,
>>> and
>>>>> 1
>>>>>>>>> aggValueSerde. The user no longer has to worry about the
>>> intermediate
>>>>>>>>> values and the joiners. All they have to think about is how each
>>>>> stream
>>>>>>>>> impacts the creation of the final CG object.
>>>>>>>>> 
>>>>>>>>> When a new input arrives lets say at "topic1" it will first go
>>> through
>>>>>>> a
>>>>>>>>> KStreamAggreagte and grab the current aggregate from storeName1. It
>>>>>>> will
>>>>>>>>> add its incoming object to the aggregate, update the store and pass
>>>>> the
>>>>>>>> new
>>>>>>>>> aggregate on. This new aggregate goes through the KStreamCogroup
>>> which
>>>>>>> is
>>>>>>>>> pretty much just a pass through processor and you are done.
>>>>>>>>> 
>>>>>>>>> Topology wise for N incoming streams the new api will only every
>>>>>>> create N
>>>>>>>>> KStreamAggregates and 1 KStreamCogroup.
>>>>>>>>> 
>>>>>>>>> On Thu, May 4, 2017 at 4:42 PM, Matthias J. Sax <
>>>>> [email protected]
>>>>>>>> 
>>>>>>>>> wrote:
>>>>>>>>> 
>>>>>>>>>> Kyle,
>>>>>>>>>> 
>>>>>>>>>> thanks a lot for the KIP. Maybe I am a little slow, but I could
>>> not
>>>>>>>>>> follow completely. Could you maybe add a more concrete example,
>>> like
>>>>>>> 3
>>>>>>>>>> streams with 3 records each (plus expected result), and show the
>>>>>>>>>> difference between current way to to implement it and the proposed
>>>>>>> API?
>>>>>>>>>> This could also cover the internal processing to see what store
>>> calls
>>>>>>>>>> would be required for both approaches etc.
>>>>>>>>>> 
>>>>>>>>>> I think, it's pretty advanced stuff you propose, and it would
>>> help to
>>>>>>>>>> understand it better.
>>>>>>>>>> 
>>>>>>>>>> Thanks a lot!
>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>>> -Matthias
>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>>> On 5/4/17 11:39 AM, Kyle Winkelman wrote:
>>>>>>>>>>> I have made a pull request. It can be found here.
>>>>>>>>>>> 
>>>>>>>>>>> https://github.com/apache/kafka/pull/2975
>>>>>>>>>>> 
>>>>>>>>>>> I plan to write some more unit tests for my classes and get
>>> around
>>>>>>> to
>>>>>>>>>>> writing documentation for the public api additions.
>>>>>>>>>>> 
>>>>>>>>>>> One thing I was curious about is during the
>>>>>>>>>> KCogroupedStreamImpl#aggregate
>>>>>>>>>>> method I pass null to the KGroupedStream#repartitionIfRequired
>>>>>>>> method.
>>>>>>>>> I
>>>>>>>>>>> can't supply the store name because if more than one grouped
>>> stream
>>>>>>>>>>> repartitions an error is thrown. Is there some name that someone
>>>>>>> can
>>>>>>>>>>> recommend or should I leave the null and allow it to fall back to
>>>>>>> the
>>>>>>>>>>> KGroupedStream.name?
>>>>>>>>>>> 
>>>>>>>>>>> Should this be expanded to handle grouped tables? This would be
>>>>>>>> pretty
>>>>>>>>>> easy
>>>>>>>>>>> for a normal aggregate but one allowing session stores and
>>> windowed
>>>>>>>>>> stores
>>>>>>>>>>> would required KTableSessionWindowAggregate and
>>>>>>> KTableWindowAggregate
>>>>>>>>>>> implementations.
>>>>>>>>>>> 
>>>>>>>>>>> Thanks,
>>>>>>>>>>> Kyle
>>>>>>>>>>> 
>>>>>>>>>>> On May 4, 2017 1:24 PM, "Eno Thereska" <[email protected]>
>>>>>>>> wrote:
>>>>>>>>>>> 
>>>>>>>>>>>> I’ll look as well asap, sorry, been swamped.
>>>>>>>>>>>> 
>>>>>>>>>>>> Eno
>>>>>>>>>>>>> On May 4, 2017, at 6:17 PM, Damian Guy <[email protected]>
>>>>>>>> wrote:
>>>>>>>>>>>>> 
>>>>>>>>>>>>> Hi Kyle,
>>>>>>>>>>>>> 
>>>>>>>>>>>>> Thanks for the KIP. I apologize that i haven't had the chance
>>> to
>>>>>>>> look
>>>>>>>>>> at
>>>>>>>>>>>>> the KIP yet, but will schedule some time to look into it
>>>>>>> tomorrow.
>>>>>>>>> For
>>>>>>>>>>>> the
>>>>>>>>>>>>> implementation, can you raise a PR against kafka trunk and mark
>>>>>>> it
>>>>>>>> as
>>>>>>>>>>>> WIP?
>>>>>>>>>>>>> It will be easier to review what you have done.
>>>>>>>>>>>>> 
>>>>>>>>>>>>> Thanks,
>>>>>>>>>>>>> Damian
>>>>>>>>>>>>> 
>>>>>>>>>>>>> On Thu, 4 May 2017 at 11:50 Kyle Winkelman <
>>>>>>>> [email protected]
>>>>>>>>>> 
>>>>>>>>>>>> wrote:
>>>>>>>>>>>>> 
>>>>>>>>>>>>>> I am replying to this in hopes it will draw some attention to
>>> my
>>>>>>>> KIP
>>>>>>>>>> as
>>>>>>>>>>>> I
>>>>>>>>>>>>>> haven't heard from anyone in a couple days. This is my first
>>> KIP
>>>>>>>> and
>>>>>>>>>> my
>>>>>>>>>>>>>> first large contribution to the project so I'm sure I did
>>>>>>>> something
>>>>>>>>>>>> wrong.
>>>>>>>>>>>>>> ;)
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> On May 1, 2017 4:18 PM, "Kyle Winkelman" <
>>>>>>>> [email protected]>
>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> Hello all,
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> I have created KIP-150 to facilitate discussion about adding
>>>>>>>>> cogroup
>>>>>>>>>> to
>>>>>>>>>>>>>>> the streams DSL.
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> Please find the KIP here:
>>>>>>>>>>>>>>> https://cwiki.apache.org/confluence/display/KAFKA/KIP-
>>>>>>>>>>>>>>> 150+-+Kafka-Streams+Cogroup
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> Please find my initial implementation here:
>>>>>>>>>>>>>>> https://github.com/KyleWinkelman/kafka
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> Thanks,
>>>>>>>>>>>>>>> Kyle Winkelman
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>> 
>>>>>>>> 
>>>>>>> 
>>>>> 
>>>>> 
>>> 
>>>

Re: [DISCUSS] KIP-150 - Kafka-Streams Cogroup

Reply via email to