Github user kalmanchapman commented on the issue: https://github.com/apache/flink/pull/2735 @kateri1 - I agree that seeking a solution with Flink's data structures is valuable. I also think that Flink-ML is in a unique position to implement streaming-first, iterative implementations of this algorithm. They are fairly novel on the web, but in theory have been implemented in [Gensim's word2vec](https://github.com/RaRe-Technologies/gensim/pull/900). Having an initial, offline implementation of word2vec in flink could be considered as a foundation for an online word2vec that Flink would be in a unique position to implement and be of great use to the community looking for a scaling solution to this class of problem
--- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---