Alexy Khrabrov wrote:

>I get tables with millions of rows.  For plotting to a screen-size  
>jpg, obviously just about 1000 points are enough.  Instead of feeding  
>plot() the original millions of rows, I'd rather shrink the original  
>dataframe, using some kind of the following interpolation:
>
>-- split dataframe into chunks of N rows each, e.g. 1000 rows each
>-- compute average for each column
>-- issue one new row of those averages into the shrunk result
>
>Is there any existing package to do that in R?  Otherwise, which R  
>idioms are most effective to achieve that?
>
>Cheers,
>Alexy
>
>
>  
>
Hi,

if you want to extract relevant information from such a table, splitting 
rows in arbitrary chuncks may not solve your problem. Ordinations in 
reduced space are designed for that kind of task, but hierachical 
clustering may also help. See Legendre & Legendre (1998, Numerical 
Ecology, Elsevier) for examples of such methods in Ecology, and the R 
packages ade4, vegan and hclust.

Regards,

Thibaut.

-- 
######################################
Thibaut JOMBART
CNRS UMR 5558 - Laboratoire de Biométrie et Biologie Evolutive
Universite Lyon 1
43 bd du 11 novembre 1918
69622 Villeurbanne Cedex
Tél. : 04.72.43.29.35
Fax : 04.72.43.13.88
[EMAIL PROTECTED]
http://lbbe.univ-lyon1.fr/-Jombart-Thibaut-.html?lang=en
http://pbil.univ-lyon1.fr/software/adegenet/

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to