our experience is that unless you can benefit from spark features such as
co-partitioning that allow for more efficient execution that spark is
slightly slower for disk to disk.
On Apr 27, 2015 10:34 PM, "bit1...@163.com" wrote:
> Hi,
>
> I am frequently asked why spark is also much faster than H
Spark is much faster than Hadoop MapReduce even on disk
I believe the typical answer is that Spark is actually a bit slower.
On Mon, Apr 27, 2015 at 7:34 PM bit1...@163.com<mailto:bit1...@163.com>
mailto:bit1...@163.com>> wrote:
Hi,
I am frequently asked why spark is also much faster
http://www.datascienceassn.org/content/making-sense-making-sense-performance-data-analytics-frameworks
From: "bit1...@163.com"
To: user
Sent: Monday, April 27, 2015 8:33 PM
Subject: Why Spark is much faster than Hadoop MapReduce even on disk
#yiv1713360705 body {line-
Is it? I learned somewhere else that spark's speed is 5~10 times faster than
Hadoop MapReduce.
bit1...@163.com
From: Ilya Ganelin
Date: 2015-04-28 10:55
To: bit1...@163.com; user
Subject: Re: Why Spark is much faster than Hadoop MapReduce even on disk
I believe the typical answer is
I believe the typical answer is that Spark is actually a bit slower.
On Mon, Apr 27, 2015 at 7:34 PM bit1...@163.com wrote:
> Hi,
>
> I am frequently asked why spark is also much faster than Hadoop MapReduce
> on disk (without the use of memory cache). I have no convencing answer for
> this quest
Hi,
I am frequently asked why spark is also much faster than Hadoop MapReduce on
disk (without the use of memory cache). I have no convencing answer for this
question, could you guys elaborate on this? Thanks!