Hi there,

I'm working on a clustering algorithm on a DataStream based on this paper
<http://www1.cse.wustl.edu/~ychen/public/sigproc-sp.pdf> and in order to
implement that, I need to maintain some global values (initially HashMap
but might be DataSet) and update it on every item arriving.

If this is even possible with Flink? (some kind of reverse broadcasting)
And if yes, what would be the right way to do it? or what are the
alternatives?

As a stream simulation I use source function based on the demo
<https://github.com/dataArtisans/flink-streaming-demo/blob/master/src/main/scala/com/dataartisans/flink_demo/sources/TaxiRideSource.scala>
.

Thanks,


-- 
Best regards,
Dima Arbuzin

Reply via email to