o there's one data point, if only for the obvious data point comparing
> computations in Scala to computations in pure Python.
>
>
>
>
>
> --
> View this message in context:
> http://apache-spark-user-list.1001560.n3.nabble.com/Scala-vs-Python-performance-differences
ons in pure Python.
--
View this message in context:
http://apache-spark-user-list.1001560.n3.nabble.com/Scala-vs-Python-performance-differences-tp4247p21190.html
Sent from the Apache Spark User List mailing list archive at
I was about to ask this question.
On Wed, Nov 12, 2014 at 3:42 PM, Andrew Ash wrote:
> Jeremy,
>
> Did you complete this benchmark in a way that's shareable with those
> interested here?
>
> Andrew
>
> On Tue, Apr 15, 2014 at 2:50 PM, Nicholas Chammas <
> nicholas.cham...@gmail.com> wrote:
>
>>
Jeremy,
Did you complete this benchmark in a way that's shareable with those
interested here?
Andrew
On Tue, Apr 15, 2014 at 2:50 PM, Nicholas Chammas <
nicholas.cham...@gmail.com> wrote:
> I'd also be interested in seeing such a benchmark.
>
>
> On Tue, Apr 15, 2014 at 9:25 AM, Ian Ferreira
>
I'd also be interested in seeing such a benchmark.
On Tue, Apr 15, 2014 at 9:25 AM, Ian Ferreira wrote:
> This would be super useful. Thanks.
>
> On 4/15/14, 1:30 AM, "Jeremy Freeman" wrote:
>
> >Hi Andrew,
> >
> >I'm putting together some benchmarks for PySpark vs Scala. I'm focusing on
> >ML
This would be super useful. Thanks.
On 4/15/14, 1:30 AM, "Jeremy Freeman" wrote:
>Hi Andrew,
>
>I'm putting together some benchmarks for PySpark vs Scala. I'm focusing on
>ML algorithms, as I'm particularly curious about the relative performance
>of
>MLlib in Scala vs the Python MLlib API vs pur
le.com/Scala-vs-Python-performance-differences-tp4247p4261.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.
At least, Spark Streaming doesn't support Python at this moment, right?
On Mon, Apr 14, 2014 at 6:48 PM, Andrew Ash wrote:
> Hi Spark users,
>
> I've always done all my Spark work in Scala, but occasionally people ask
> about Python and its performance impact vs the same algorithm
> implementat
Hi Spark users,
I've always done all my Spark work in Scala, but occasionally people ask
about Python and its performance impact vs the same algorithm
implementation in Scala.
Has anyone done tests to measure the difference?
Anecdotally I've heard Python is a 40% slowdown but that's entirely hea