Dear Colleagues,

        I've been searching for a post or article or something which
explains why having na.rm=FALSE or na.action=na.fail as the default is a
better choice than TRUE or na.omit.

        I understand the basic argument: it does not make sense to average a
nonexistance into an aggregate, and removing them implicitly leads to
accidental pairwise deletion in some cases, and sum(x) / length(x) < mean(x)
(which many would find disturbing)...I'm just looking for a source to cite
on this issue to support mimicking R's behavior in a database system's
aggregating functions (sum, avg, var, etc.).

Cordially,
Adam Kramer
Ph.D. Candidate, Social Psychology
University of Oregon
adik at uoregon dot edu

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to