Hi,

I have a dataset with clustered data (observations within groups) and would 
like to make some descriptive plots.

Now, I am a little bit lost on how to present the dispersion of the data (what 
kind of residuals to plot).
I could compute the standard error of the mean (SEM) ignoring the clustering 
(very low values and misleading) or I could first aggregate the data by 
calculating th mean for each group and calculate the SEM for this means.
But I am not so sure what implication these two approaches have. In the end, I 
take the clustering into account by fitting a random-intercept regression model 
– however for plotting I would like to have a descriptive dispersion indicator 
of the data.

Now, I heard a lot about 'clustered' or 'robust' standard errors.
Is there some kind of correction I can apply to the simple SEM formula 
(sd(x)/sqrt(m)) to take care of correlated observations within clusters?
Or are there bootstrapping or jackknife approaches implemented in R (or cran 
package) which give me unbiased variance estimation for clustered data?

thanks for any suggestions!

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to