Re: Clustering algorithms in Spark

2015-04-22 Thread Jeetendra Gangele
does anybody have any thought on this? On 21 April 2015 at 20:57, Jeetendra Gangele wrote: > The problem with k means is we have to define the no of cluster which I > dont want in this case > So thinking for something like hierarchical clustering any idea and > suggestions? > > > > On 21 April 2

Re: Clustering algorithms in Spark

2015-04-21 Thread Jeetendra Gangele
The problem with k means is we have to define the no of cluster which I dont want in this case So thinking for something like hierarchical clustering any idea and suggestions? On 21 April 2015 at 20:51, Jeetendra Gangele wrote: > I have a requirement in which I want to match the company name .

Clustering algorithms in Spark

2015-04-21 Thread Jeetendra Gangele
I have a requirement in which I want to match the company name .. and I am thinking to solve this using clustering technique. Can anybody suggest which algo I should Use in Spark and how to evaluate the running time and accuracy for this particular problem. I checked K means looks good. Any idea