Re: How to chain increasing window operations one after another

Michal Borowiecki Tue, 09 May 2017 06:22:00 -0700

Just had a thought:

If you implement the Windowed/Tuple serde to store the timestamp(s)before the actual record key then you can simply periodically do aranged query on each of the state stores to find and delete all dataolder than ... (using punctuate() inside a Processor).


Any downsides to that?

Cheers,

Michał


On 09/05/17 09:17, Michal Borowiecki wrote:

Hi Matthias,
Yes, the ever growing stores were my concern too. That was theintention behind my TODO note in the first reply just didn't want totouch on this until I've dug deeper into it.
I understand compaction+retention policy on the backing changelogtopics takes care of cleaning up on the broker-side but Rocks dbs willgrow indefinitely, right? (until re-balanced?)
Punctuation was the first idea that came to my mind too whenoriginally faced this problem on my project. However, as you said it'sonly on KStream and aggregations on KStream actually discardtombstones and don't forward them on to the KTable:
https://github.com/apache/kafka/blob/trunk/streams/src/main/java/org/apache/kafka/streams/kstream/KGroupedStream.java#L798-L799
     * Aggregate the values of records in this stream by the grouped key.
     * Records with {@code null} key or value are ignored.
I haven't come up with a satisfactory solution yet, but it's still onmy mind.
TTLs on stores could potentially solve this issue and just today theywere asked about on SO:http://stackoverflow.com/questions/43860114/kafka-streams-low-level-processor-api-rocksdb-timetolivettl/43862922#43862922
Garrett, was that you? :-)


Thanks,

Michał


On 08/05/17 23:29, Matthias J. Sax wrote:
Thinking about this once more (and also having a fresh memory of another
thread about KTables), I am wondering if this approach needs some extra
tuning:

As the result of the first window aggregation produces an output stream
with unbounded key space, the following (non-windowed) KTables would
grow indefinitely, if I don't miss anything.

Thus, it might be required to put a transform() that only forwards all
data 1-to-1, but additionally registers a punctuation schedule. When
punctuation is called, it would be required to send tombstone messages
downstream (or a simliar) that deletes windows that are older than the
retention time. Sound tricky to implement though... `transform()` would
need to keep track of used keys to send appropriate tombstones in an
custom state. Also. `transform` is only available for KStream and
transforming (windowed) KTable into KStream into KTable while preserving
the required semantics seems not to be straight forwards.

Any thoughts about this potential issue?


-Matthias


On 5/8/17 3:05 PM, Garrett Barton wrote:
Michael,
   This is slick!  I am still writing unit tests to verify it.  My code
looks something like:

KTable<Windowed<String>, CountSumMinMaxAvgObj> oneMinuteWindowed =
srcStream // my val object isnt really called that, just wantedto show
a sample set of calculations the value can do!
     .groupByKey(Serdes.String(), Serdes.Double())
.aggregate(/*initializer */, /* aggregator */,TimeWindows.of(60*1000,
60*1000), "store1m");
// i used an aggregate here so I could have a non-primitivevalue objectthat does the calculations on each aggregator, pojo has an.add(Double) in
it.
KTable<Tuple2<String, Long>, CountSumMinMaxAvgObj> fiveMinuteWindowed =
oneMinuteWindowed // I made my own Tuple2, will move window calcinto it.groupBy( (windowedKey, value) -> new KeyValue<>(newTuple2<String,Long>(windowedKey.key(), windowedKey.window().end() /1000/60/5*1000*60*5),
value, keySerde, valSerde)
// the above rounds time down to a timestamp divisible by 5minutes
     .reduce(/*your adder*/, /*your subtractor*/, "store5m");
// where your subtractor can be as simple as (val, agg) ->agg - val
for primitive types or as complex as you need,
// just make sure you get the order right (lesson hardlearnt ;) ),
subtraction is not commutative!

         // again my val object has an .add(Obj) and a .sub() to handle
this, so nice!
KTable<Tuple2<String, Long>, CountSumMinMaxAvgObj>fifteenMinuteWindowed =
fiveMinuteWindowed
.groupBy( (keyPair, value) -> new KeyValue<>(newTuple2(keyPair._1,
keyPair._2 /1000/60/15 *1000*60*15), value, keySerde, valSerde)
// the above rounds time down to a timestamp divisible by15 minutes
     .reduce(/*your adder*/, /*your subtractor*/, "store15m");
KTable<Tuple2<String, Long>, CountSumMinMaxAvgObj>sixtyMinuteWindowed =
fifteeenMinuteWindowed
.groupBy( (keyPair, value) -> new KeyValue<>(newTuple2(keyPairair._1,
pair._2 /1000/60/60 *1000*60*60), value, keySerde, valSerde)
// the above rounds time down to a timestamp divisible by60 minutes
     .reduce(/*your adder*/, /*your subtractor*/, "store60m");


Notes thus far:
Doesn't look like I need to start the 5min with a windowed KTablereturnobject, it starts with the regular KTable<Tuple2<String,Long>> inthis case.I thinking about using windowedKey.window().start() instead ofend() as Ibelieve that is more consistent with what the windows themselves putout.
They go into the stores bound by their start time I believe.
Serdes gets nuts as well as the Generic typing on some of theseclasses
(yea you KeyValueMapper), makes for long code!  I had to specify them
everywhere since the key/val's changed.


I didn't get enough time to mess with it today, I will wrap up the unit
tests and run it to see how it performs against my real data as well
tomorrow. I expect a huge reduction in resources (both streams andkafka
storage) by moving to this.
Thank you!



On Mon, May 8, 2017 at 5:26 PM, Matthias J. Sax <matth...@confluent.io>
wrote:
Michal,
that's an interesting idea. In an ideal world, Kafka Streams shouldhave
an optimizer that is able to to this automatically under the hood. Too
bad we are not there yet.

@Garret: did you try this out?

This seems to be a question that might affect many users, and it might
we worth to document it somewhere as a recommended pattern.


-Matthias


On 5/8/17 1:43 AM, Michal Borowiecki wrote:
Apologies,
In the code snippet of course only oneMinuteWindowed KTable willhave a
Windowed key (KTable<Windowed<Key>, Value>), all others would be just
KTable<Tuple2<Key, Long>, Value>.

Michał

On 07/05/17 16:09, Michal Borowiecki wrote:
Hi Garrett,
I've encountered a similar challenge in a project I'm working on(it'sstill work in progress, so please take my suggestions with agrain of
salt).
Yes, I believe KTable.groupBy lets you accomplish what you areaiming
for with something like the following (same snippet attached as txt
file):
KTable<Windowed<Key>, Value> oneMinuteWindowed = yourStream    //
where Key and Value stand for your actual key and value types

     .groupByKey()

     .reduce(/*your adder*/, TimeWindows.of(60*1000, 60*1000),
"store1m");
//where your adder can be as simple as (val, agg) -> agg+ val
         //for primitive types or as complex as you need


KTable<Windowed<Tuple2<Key, Long>>, Value> fiveMinuteWindowed =
oneMinuteWindowed    // Tuple2 for this example as defined by
javaslang library

     .groupBy( (windowedKey, value) -> new KeyValue<>(new
Tuple2<>(windowedKey.key(), windowedKey.window().end() /1000/60/5
*1000*60*5), value)

         // the above rounds time down to a timestamp divisible by 5
minutes

     .reduce(/*your adder*/, /*your subtractor*/, "store5m");
// where your subtractor can be as simple as (val, agg)-> agg
- valfor primitive types or as complex as you need,
// just make sure you get the order right (lesson hardlearnt
;) ), subtraction is not commutative!


KTable<Windowed<Tuple2<Key, Long>>, Value> fifteenMinuteWindowed =
fiveMinuteWindowed

     .groupBy( (keyPair, value) -> new KeyValue<>(new
Tuple2(keyPair._1, keyPair._2/1000/60/15 *1000*60*15), value)
// the above rounds time down to a timestamp divisibleby 15
minutes

     .reduce(/*your adder*/, /*your subtractor*/, "store15m");


KTable<Windowed<Tuple2<Key, Long>>, Value> sixtyMinuteWindowed =
fifteeenMinuteWindowed

     .groupBy( (keyPair, value) -> new KeyValue<>(new
Tuple2(keyPairair._1, pair._2 /1000/60/60 *1000*60*60), value)

         // the above rounds time down to a timestamp divisible by 5
minutes

     .reduce(/*your adder*/, /*your subtractor*/, "store60m");


So, step by step:

   * You use a windowed aggregation only once, from there on you use
     the KTable abstraction only (which doesn't have windowed
     aggregations).
   * In each subsequent groupBy you map the key to a pair of
     (your-real-key, timestamp) where the timestamp is rounded down
     with the precision of the size of the new window.
* reduce() on a KGroupedTable takes an adder and a subtractorand itwill correctly update the new aggregate by first subtractingtheprevious value of the upstream record before adding the newvalue
     (this way, just as you said, the downstream is aware of the
statefulness of the upstream and correctly treats eachrecord as
     an update)
* If you want to reduce message volume further, you can breaktheseinto separate KafkaStreams instances and configuredownstream ones
     with a higher commit.interval.ms (unfortunately you can't have
different values of this setting in different places of thesame
     topology I'm afraid)
* TODO: Look into retention policies, I haven't investigatedthat in
     any detail.

I haven't tested this exact code, so please excuse any typos.

Also, if someone with more experience could chip in and check if I'm
not talking nonsense here, or if there's an easier way to this, that
would be great.


I don't know if the alternative approach is possible, where you
convert each resulting KTable back into a stream and just do a
windowed aggregation somehow. That would feel more natural, but I
haven't figured out how to correctly window over a changelog in the
KStream abstraction, feels impossible in the high-level DSL.

Hope that helps,
Michal

On 02/05/17 18:03, Garrett Barton wrote:
Lets say I want to sum values over increasing window sizes of1,5,15,60minutes. Right now I have them running in parallel, meaning ifI am
producing 1k/sec records I am consuming 4k/sec to feed each
calculation.
In reality I am calculating far more than sum, and in thispattern I'm
looking at something like (producing rate)*(calculations)*(windows)
for a
consumption rate.

  So I had the idea, could I feed the 1 minute window into the 5
minute, and
5 into 15, and 15 into 60. Theoretically I would consume a fraction
of the
records, not have to scale as huge and be back to something like
(producing
rate)*(calculations)+(updates).
Thinking this is an awesome idea I went to try and implementit and
got
twisted around. These are windowed grouping operations thatproduceKTables, which means instead of a raw stream I have an updatestream.
To
me this implies that downstream must be aware of this and consume
stateful
information, knowing that each record is an update and not an in
addition
to.  Does the high level api handle that construct and let me do
that?  For
a simple sum it would have to hold each of the latest values forsay
the 5
1 minute sum's in a given window, to perform the 5 minute sum.
Reading the
docs which are awesome, I cannot determine if the KTable.groupby()
would
work over a window, and would reduce or aggregate thus do what Ineed?
Any ideas?
--
Signature
<http://www.openbet.com/>    Michal Borowiecki
Senior Software Engineer L4
      T:      +44 208 742 1600


      +44 203 249 8448



      E:      michal.borowie...@openbet.com
      W:      www.openbet.com <http://www.openbet.com/>


      OpenBet Ltd

      Chiswick Park Building 9

      566 Chiswick High Rd

      London

      W4 5XT

      UK


<https://www.openbet.com/email_promo>

This message is confidential and intended only for the addressee. If
you have received this message in error, please immediatelynotify the
postmas...@openbet.com <mailto:postmas...@openbet.com> and delete it
from your system as well as any copies. The content of e-mails aswell
as traffic data may be monitored by OpenBet for employment and
security purposes. To protect the environment please do not printthise-mail unless necessary. OpenBet Ltd. Registered Office: ChiswickParkBuilding 9, 566 Chiswick High Road, London, W4 5XT, UnitedKingdom. A
company registered in England and Wales. Registered no. 3134634. VAT
no. GB927523612
--
Signature
<http://www.openbet.com/>     Michal Borowiecki
Senior Software Engineer L4
       T:      +44 208 742 1600


       +44 203 249 8448



       E:      michal.borowie...@openbet.com
       W:      www.openbet.com <http://www.openbet.com/>


       OpenBet Ltd

       Chiswick Park Building 9

       566 Chiswick High Rd

       London

       W4 5XT

       UK


<https://www.openbet.com/email_promo>
This message is confidential and intended only for the addressee.If you
have received this message in error, please immediately notify the
postmas...@openbet.com <mailto:postmas...@openbet.com> and delete it
from your system as well as any copies. The content of e-mails aswellas traffic data may be monitored by OpenBet for employment andsecurity
purposes. To protect the environment please do not print this e-mail
unless necessary. OpenBet Ltd. Registered Office: Chiswick ParkBuilding
9, 566 Chiswick High Road, London, W4 5XT, United Kingdom. A company
registered in England and Wales. Registered no. 3134634. VAT no.
GB927523612

Re: How to chain increasing window operations one after another

Reply via email to