Hello R users,
I am working with a large dataset, including roughly 50 000 sequential
observations (variable "count") for 8000 individuals (variable "id"). The
dataset is very unbalanced, meaning that some individuals have few observations
and others have many. Because I plan on running Generalized Linear Models for
panel data using pglm and the package has file size restrictions, I want to
create 4 randomly selected subsets of 2500 individuals from the main dataset.
What functions and code would I use to do this?
Thanks in advance,
David Joubert
[[alternative HTML version deleted]]
______________________________________________
[email protected] mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.