> It will of necessity be slower (because there's more machinery underlying
> the sqldf package); but I doubt whether it would be noticeably slower than
> the native R solution in most practical situations. The same would be true
> for plyR's implementation (it relies on the proto package, which slows
> things down a bit).

Plyr doesn't use proto at all (that's ggplot2).  Plyr is generally
faster than split + lapply etc for large datasets with many splits,
but slower with smaller datasets/fewer splits.

> The point is that the most important issue in almost all cases is the
> programmer's time to create and debug correct code, especially as the native
> machine speeds continue to increase. R gives you the option to choose
> whatever idiom you prefer to minimize this. The software implementation
> differences thereafter will rarely be important.

Totally agreed!  In my mind the advantage of learning plyr, is that
you learn one set of methods that work for lists, data frames and
arrays. And because all of the functions are designed with consistency
in mind, it hopefully takes less time to learn them all.

Hadley

-- 
http://had.co.nz/

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to