Hi All,
I know the benefit of RDD caching, but abuse of using cache may cause memory leak. In graphx, we can cache a graph by using graph.cache(), and many transformation of graph create and cache new edges, like partitionBy() and subgraph(). However, I can not find an interface to unpersist edges. I wonder the purpose of design like this. It can indeed improve performance. But, it may lead to memory leak if not unpersist edges in some cases. For example, in Spark-1.0, the pregel can not unpersist edges of old graph effectively, and leads to memory leak. Much Thanks!