Re: Checkpointed RDD still causing StackOverflow

2014-06-22 Thread dash
Hi Xiangrui, According to my knowledge, calling count is for materialize the RDD, does collect do the same thing since it also an action? I can not call count because for a Graph object, count does not materialize the RDD. I already send an issue on that. My question is, why there still have stac

Re: Checkpointed RDD still causing StackOverflow

2014-06-22 Thread Xiangrui Meng
After checkpoint(), please call count(). This is similar to cache(), the RDD is only marked as to be checked with checkpoint(). -Xiangrui On Sun, Jun 22, 2014 at 3:14 PM, dash wrote: > Hi, > > I'm doing iterative computing now, and due to lineage chain, we need to > checkpoint the RDD in order to

Checkpointed RDD still causing StackOverflow

2014-06-22 Thread dash
Hi, I'm doing iterative computing now, and due to lineage chain, we need to checkpoint the RDD in order to cut off lineage and prevent StackOverflow error. The following code still having StackOverflowError, I checked `isCheckpointed` and the result is true. Also, I write a function to count the

GraphX's VertexRDD can not be materialized by calling count()

2014-06-22 Thread dash
Hi there, Seems one can not materialize VertexRDD by simply calling count method, which is overridden by VertexRDD. But if you call RDD's count, it could materialize it. Is this a feature that designed to get the count without materialize VertexRDD? If so, do you guys think it is necessary to a

Re: Assorted project updates (tests, build, etc)

2014-06-22 Thread Mark Hamstra
Just a couple of FYI notes: With Zinc and the scala-maven-plugin, repl and incremental builds are also available to those doing day-to-day development using Maven. As long as you don't have to delve into the extra boilerplate and verbosity of Maven's POMs relative to an SBT build file, there is li

Assorted project updates (tests, build, etc)

2014-06-22 Thread Patrick Wendell
Hey All, 1. The original test infrastructure hosted by the AMPLab has been fully restored and also expanded with many more executor slots for tests. Thanks to Matt Massie at the Amplab for helping with this. 2. We now have a nightly build matrix across different Hadoop versions. It appears that t