"Second, one of the RDDs is cached in the first stage (denoted by the green
highlight). Since the enclosing operation involves reading from HDFS,
caching this RDD means future computations on this RDD can access at least
a subset of the original file from memory instead of from HDFS."

from
https://databricks.com/blog/2015/06/22/understanding-your-spark-application-through-visualization.html

On Thu, Nov 17, 2016 at 9:19 AM, Reynold Xin <r...@databricks.com> wrote:

> Ha funny. Never noticed that.
>
>
> On Thursday, November 17, 2016, Nicholas Chammas <
> nicholas.cham...@gmail.com> wrote:
>
>> Hmm... somehow the image didn't show up.
>>
>> How about now?
>>
>> [image: Screen Shot 2016-11-17 at 11.57.14 AM.png]
>>
>> On Thu, Nov 17, 2016 at 12:14 PM Herman van Hövell tot Westerflier <
>> hvanhov...@databricks.com> wrote:
>>
>>> Should I be able to see something?
>>>
>>> On Thu, Nov 17, 2016 at 9:10 AM, Nicholas Chammas <
>>> nicholas.cham...@gmail.com> wrote:
>>>
>>>> Some questions about this DAG visualization:
>>>>
>>>> [image: Screen Shot 2016-11-17 at 11.57.14 AM.png]
>>>>
>>>> 1. What's the meaning of the green dot?
>>>> 2. Should this be documented anywhere (if it isn't already)? Preferably
>>>> a tooltip or something directly in the UI would explain the significance.
>>>>
>>>> Nick
>>>>
>>>>
>>>

Reply via email to