Re: [R] calculating the mean of a random matrix (by row) and some general questions

David Winsemius Tue, 19 Jul 2011 14:26:18 -0700


On Jul 19, 2011, at 4:18 PM, Peter Lomas wrote:

Hi Richard,
As others have said, try to use the "apply" functions rather thanloops.There is also an apply function for lists, see ?lapply. This ismuch more
efficient.

Actually the "apply" functions are not "more efficient" in the usualmeaning of time of execution. And sometimes they is ratherinefficient. Prior discussions of this topic in the archives should beeasy to find. The economy is in expression and the advantage is incode creation and maintenance.


Doubters of this proposition should consider these results:

library(rbenchmark) # help page has a more compact version of thesetests


means.rep = function(n, m) {res1 <- vector(length=100, mode="numeric")
              res1 <- replicate(n, mean( rexp(m)))}
means.colMn = function(n, m) {res2 <- vector(length=100, mode="numeric")
               res2 <- colMeans(matrix( rexp(n*m), c(m, n)))}
means.tapply = function(n,m) {res3 <- vector(length=100, mode="numeric")
                 res3 <- tapply( rexp(n*m), rep(1:n, each = m), mean)}
means.apply =function(n,m) { res4 <- vector(length=100, mode="numeric")
                res4 <-apply( matrix(rexp(m*n),n,m), 1, mean) }

means.forloop =function(n, m) {res5 <- vector(length=100,mode="numeric")

                 for (i in n) {res5[i] <-mean(rexp(m))} }
benchmark(
   repl = means.rep(100, 100),
   tappl = means.tapply(100, 100),
   appl = means.apply(100, 100),
   pat = means.pat(100, 100),
   forloop =  means.forloop(100,100),
   replications=100, columns=c("test","replications","elapsed"),
   order='elapsed' )

###
Results:
     test replications relative elapsed
5 forloop          100     1.00   0.004
4     pat          100    20.25   0.081
1    repl          100    77.00   0.308
3    appl          100    89.75   0.359
2   tappl          100   264.50   1.058

I admit that I was rather surprised to see the for-loop beatingcolMeans by such a wide margin, and this is making me wonder if Ireversed some index or coded the for-loop test wrong. So wouldappreciate some auditing and improvement of this test. (But I don'tsee how I could have reversed the order since the n and m are both100. And I tried adding assignments to see if there were only promisesbeing made with no calculations. The relative efficiencies stays thesame.)


--
David.

 I also like writing my own functions.  For example:

f <- function(x) {
  x^2
}

Which can then be used by:
f(2)
[1] 4
This is very useful if you're getting into maximum likelihoodprogramming,
or want to use the "optim" function (for multivariate functions) or
"optimize" (for univariate functions).

Lastly, check out the R reference card.
http://cran.r-project.org/doc/contrib/Short-refcard.pdf

Regards,
Peter
On Tue, Jul 19, 2011 at 12:43, RichardLang <l...@zedat.fu-berlin.de>wrote:
Hi everyone!

I'm trying to teach myself R in order to do some data analysis. I'm a
mathematics student and (only) familiar with matlab and latex. I'mworkingtrough the "official" introduction to R at the moment, whilesimultaneously
solving some exercises I found in the web. Before I post my (probably
stupid) question, I'd like to ask you for some general advice. Howdo youwork with R? Is it like in matlab, that you write your functionswith a lotof loops etc. in a textfile and then run it? Or do you just prepareyourdata and then use the functions provided by R (plot, mean etc) toget some
analysis? I'd be very thankfull for some of your thoughts about
"approaches".

Now the question: I'm trying to build a vector with n entries, each
consisting of the mean of m random numbers (exponential distributedforexample). My approach was to construct a nxm random matrix and thentosomehow take the mean of each row. But in the mean function thereis noparameter to do this, so the intended approach of R is probablydifferent..
any ideas? =)

Richard



David Winsemius, MD
West Hartford, CT

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] calculating the mean of a random matrix (by row) and some general questions

Reply via email to