Hi all,
Please find attached the image of benchmark results. The table in the
previous mail got messed up. Thanks.
On Friday 19 September 2014 10:55 AM, Meethu Mathew wrote:
Hi all,
We have come up with an initial distributed implementation of Gaussian
Mixture Model in pyspark where the par
Hi all,
We have come up with an initial distributed implementation of Gaussian
Mixture Model in pyspark where the parameters are estimated using the
Expectation-Maximization algorithm.Our current implementation considers
diagonal covariance matrix for each component.
We did an initial benchmar
2014-09-16 22:32 GMT-07:00 Jun Feng Liu :
> I see. Thank you, it works for me. It looks confusing to have two ways
> expose configuration though.
>
I agree. We're working on it. :)
> Best Regards
>
>
> *Jun Feng Liu*
> IBM China Systems & Technology Laboratory in Beijing
>
>
Yes - I believe we use the local dirs for spilling as well.
On Thu, Sep 18, 2014 at 7:57 AM, Tom Hubregtsen wrote:
> Hi all,
>
> Just one line of context, since last post mentioned this would help:
> I'm currently writing my masters thesis (Computer Engineering) on storage
> and memory in both Sp
Hi all,
Just one line of context, since last post mentioned this would help:
I'm currently writing my masters thesis (Computer Engineering) on storage
and memory in both Spark and Hadoop.
Right now I'm trying to analyze the spilling behavior of Spark, and I do not
see what I expect. Therefor, I w