http://www.slideshare.net/cloudera/top-5-mistakes-to-avoid-when-writing-apache-spark-applications
there is a section that is connected to your question On 23 February 2016 at 16:49, Alex Dzhagriev <dzh...@gmail.com> wrote: > Hello all, > > Can someone please advise me on the pros and cons on how to allocate the > resources: many small heap machines with 1 core or few machines with big > heaps and many cores? I'm sure that depends on the data flow and there is > no best practise solution. E.g. with bigger heap I can perform map-side > join with bigger table. What other considerations should I keep in mind in > order to choose the right configuration? > > Thanks, Alex. >