GraphX's VertexRDD can not be materialized by calling count()

2014-06-22 Thread dash
Hi there, Seems one can not materialize VertexRDD by simply calling count method, which is overridden by VertexRDD. But if you call RDD's count, it could materialize it. Is this a feature that designed to get the count without materialize VertexRDD? If so, do you guys think it is necessary to a

Checkpointed RDD still causing StackOverflow

2014-06-22 Thread dash
Hi, I'm doing iterative computing now, and due to lineage chain, we need to checkpoint the RDD in order to cut off lineage and prevent StackOverflow error. The following code still having StackOverflowError, I checked `isCheckpointed` and the result is true. Also, I write a function to count the

Re: Checkpointed RDD still causing StackOverflow

2014-06-22 Thread dash
Hi Xiangrui, According to my knowledge, calling count is for materialize the RDD, does collect do the same thing since it also an action? I can not call count because for a Graph object, count does not materialize the RDD. I already send an issue on that. My question is, why there still have stac

Re: Checkpointed RDD still causing StackOverflow

2014-06-24 Thread dash
Due to SPARK-2245, you can not use count to materialize VertexRDD. That actually materialize PartitionRDD, so checkpoint for VertexRDD won't work. I'll trying to fix that right now. -- View this message in context: http://apache-spark-developers-list.1001551.n3.nabble.com/Checkpointed-RDD-still