Restarting this thread since it is relevant to us. We are thinking of using HBase/Cassandra to store graph data and then load the data from here into Flink/Gelly. One of the issues we are concerned about is the read performance. So far we tried our tests with data residing on HDFS and that worked fine.
Is there any guidance on reading from HBase for batch jobs ? Wondering if any experience with this approach. Do's/Don'ts etc.. Thanks -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/