Hi Maxim,
I think Ken's approach is a good idea. However, you would need to a add a
stateful operator to join the results of the individual queries if that is
needed.
In order to join the results, you would need a unique id on which you can
keyBy() to collect all 20 records that originated from th
Hi Maxim,
If reducing latency is the goal, then option #1 seems better.
Though you’d need additional logic inside of your AsyncFunction to run all 20
queries in parallel.
I’d also consider a third option...
Use a FlatMapFunction to create 20 copies of the event (assuming it’s not
large), with
Hi everyone,
I'm writing streaming job which needs to query Cassandra for each event
multiple times, around 20. I would like to use Async IO for that but not
sure which option to choose:
1. Implement One AsyncFunction with 20 queries inside
2. Implement 20 AsyncFunctions, each with 1 query inside