are you using job server or just reusing spark context? Mayur Rustagi Ph: +1 (760) 203 3257 http://www.sigmoidanalytics.com @mayur_rustagi <https://twitter.com/mayur_rustagi>
On Wed, Mar 5, 2014 at 10:30 PM, polkosity <polkos...@gmail.com> wrote: > After changing to reuse spark context and cache RDDs in memory, performance > is 4 times better. We didn't expect that much of an improvement! > > > > -- > View this message in context: > http://apache-spark-user-list.1001560.n3.nabble.com/Job-initialization-performance-of-Spark-standalone-mode-vs-YARN-tp2016p2340.html > Sent from the Apache Spark User List mailing list archive at Nabble.com. >