Re: Gaussian Mixture Model clustering

2014-09-18 Thread Meethu Mathew
Hi all, Please find attached the image of benchmark results. The table in the previous mail got messed up. Thanks. On Friday 19 September 2014 10:55 AM, Meethu Mathew wrote: Hi all, We have come up with an initial distributed implementation of Gaussian Mixture Model in pyspark where the par

Gaussian Mixture Model clustering

2014-09-18 Thread Meethu Mathew
Hi all, We have come up with an initial distributed implementation of Gaussian Mixture Model in pyspark where the parameters are estimated using the Expectation-Maximization algorithm.Our current implementation considers diagonal covariance matrix for each component. We did an initial benchmar

Re: Spark authenticate enablement

2014-09-18 Thread Andrew Or
2014-09-16 22:32 GMT-07:00 Jun Feng Liu : > I see. Thank you, it works for me. It looks confusing to have two ways > expose configuration though. > I agree. We're working on it. :) > Best Regards > > > *Jun Feng Liu* > IBM China Systems & Technology Laboratory in Beijing > >

Re: Spark spilling location

2014-09-18 Thread Patrick Wendell
Yes - I believe we use the local dirs for spilling as well. On Thu, Sep 18, 2014 at 7:57 AM, Tom Hubregtsen wrote: > Hi all, > > Just one line of context, since last post mentioned this would help: > I'm currently writing my masters thesis (Computer Engineering) on storage > and memory in both Sp

Spark spilling location

2014-09-18 Thread Tom Hubregtsen
Hi all, Just one line of context, since last post mentioned this would help: I'm currently writing my masters thesis (Computer Engineering) on storage and memory in both Spark and Hadoop. Right now I'm trying to analyze the spilling behavior of Spark, and I do not see what I expect. Therefor, I w