What is the equivalent of Spark RDD is Flink

Sourav Mazumder Thu, 24 Dec 2015 07:49:19 -0800

Hi,

I am new to Flink. Trying to understand some of the basics of Flink.


What is the equivalent of Spark's RDD in Flink ? In my understanding the
closes think is DataSet API. But wanted to reconfirm.

Also using DataSet API if I ingest a large volume of data (val lines :
DataSet[String] = env.readTextFile(<some file path and name>)), which may
not fit in single slave node, will that data get automatically distributed
in the memory of other slave nodes ?

Regards,
Sourav

What is the equivalent of Spark RDD is Flink

Reply via email to