How to generate seeded random numbers in GraphX Pregel API vertex procedure?

2016-06-02 Thread Roman Pastukhov
As far as I understand, best way to generate seeded random numbers in Spark is to use mapPartititons with a seeded Random instance for each partition. But graph.pregel in GraphX does not have anything similar to mapPartitions. Can something like this be done in GraphX Pregel API?

Is shutting down of SparkContext optional?

2014-03-19 Thread Roman Pastukhov
Hi, After switching from Spark 0.8.0 to Spark 0.9.0 (and to Scala 2.10) one application started hanging after main thread is done (in 'local[2]' mode, without a cluster). Adding SparkContext.stop() at the end solves this. Is this behavior normal and shutting down of SparkContext is required?

Log analyzer and other Spark tools

2014-03-17 Thread Roman Pastukhov
Hi. We're thinking about writing a tool that would read Spark logs and output cache contents at some point in time (e.g. if you want to see what data fills the cache and whether some of it may be unpersisted to improve performance). Are there similar projects that already exist? Is there a list o