It may be simple question...But, I am struggling to understand this....

DStream is a sequence of RDDs created in a batch window.  So, how do I know
how many RDDs are created in a batch?

I am clear about the number of partitions created which is

Number of Partitions =  (Batch Interval / spark.streaming.blockInterval) *
number of receivers

Is it like one RDD per receiver? or Multiple RDDs per receiver? What is the
easiest way to find it?

Arun

Reply via email to