The implementation inside the Python API and Scala API for RDD is slightly
different, so the difference of RDD lineage you printed is expected.
On Tue, Aug 16, 2016 at 10:58 AM, DEEPAK SHARMA wrote:
> Hi All,
>
>
> Below is the small piece of code in scala and python REPL in Apache
> Spark.Howev
Hi All,
Below is the small piece of code in scala and python REPL in Apache
Spark.However I am getting different output in both the language when I execute
toDebugString.I am using cloudera quick start VM.
PYTHON
rdd2 =
sc.textFile('file:/home/training/training_materials/data/frostroad.txt')