Hi,

I am new to Flink. Trying to understand some of the basics of Flink.

What is the equivalent of Spark's RDD in Flink ? In my understanding the
closes think is DataSet API. But wanted to reconfirm.

Also using DataSet API if I ingest a large volume of data (val lines :
DataSet[String] = env.readTextFile(<some file path and name>)), which may
not fit in single slave node, will that data get automatically distributed
in the memory of other slave nodes ?

Regards,
Sourav

Reply via email to