Re: [R] hclust, does order of data matter?

Christian Hennig Mon, 15 Nov 2010 14:22:13 -0800

I don't know how the hclust function is implemented, but generally inhierarchical clustering the result can be ambiguous if there are severaldistances of identical value in the dataset (or identical between-clusterdistances occur when aggregating clusters). The role of the order of thedata depends on how these ambiguities are resolved. It may well be that insuch cases if at some point when building the hierarchy there are twodifferent possibilities to merge clusters at the same distance value whatis done by hclust is determined by the order.


Hope this helps,
Christian


On Mon, 15 Nov 2010, rchowdhury wrote:


Hello,

I am using the hclust function to cluster some data.  I have two separate
files with the same data.  The only difference is the order of the data in
the file.  For some reason, when I run the two files through the hclust
function, I get two completely different results.

Does anyone know why this is happening?  Does the order of the data matter?

Thanks,
RC
--
View this message in context: 
http://r.789695.n4.nabble.com/hclust-does-order-of-data-matter-tp3043896p3043896.html
Sent from the R help mailing list archive at Nabble.com.

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


*** --- ***
Christian Hennig
University College London, Department of Statistical Science
Gower St., London WC1E 6BT, phone +44 207 679 1698
chr...@stats.ucl.ac.uk, www.homepages.ucl.ac.uk/~ucakche

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] hclust, does order of data matter?

Reply via email to