Hello,
I would like to use aggregate with a function that
requires several argument that are columns of data.
As a simple example, suppose I have data in the following dataframe and I
would like to summarize the difference between columns obs and frc for each
site. How would I do this? (In reality, the function is slightly more
complicated, but still requires several columns of data.)
> DAT <- data.frame(site = rep(c("A","B"), 10), obs = rnorm(20), frc =
rnorm(20))
> DAT
site obs frc
1 A 1.27451057 -1.68995017
2 B 1.43942253 0.41672963
3 A -0.10875319 0.77108721
4 B -0.63198144 -0.21772356
5 A -0.42084163 0.50997647
....
I have tried variations on the following syntax with no success.
> F <- function(sub){
mean(sub[,"obs"] - sub[,"frc"] )
}
> aggregate(DAT[,c("obs", "frc")], by = list(DAT$site), F)
I have had partial success with the by command, but the by-class object is
awkward and I would like to use the aggregate command to be consistent with
other functions.
Thanks,
Matt
--
Matt Pocernich
National Center for Atmospheric Research
Research Applications Laboratory
(303) 497-8312
--
Matt Pocernich
National Center for Atmospheric Research
Research Applications Laboratory
(303) 497-8312
______________________________________________
[email protected] mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.