My problem is I have data with both categorial and numerical data,
currently only the categorical number contains missing data, was wondering
do I make a new dataframe containing only the categorical columns?

How would you use Latent Class Model specifically poLCA to impute the
missing data?
http://www.sscnet.ucla.edu/polisci/faculty/lewis/pdf/poLCA-JSS-final.pdf

The reason why I chose not to use Multiple Imputation(MI) is because
according to
[http://blogs.iq.harvard.edu/sss/archives/2008/09/a_handy_trick_f.shtml]

"MI packages assume the Multivariate Normal Distribution which may not hold
for certain types of categorical and binary data. Yucel Recai, Yulei He,
and Alan Zaslavsky point out in their May 2008 article in The American
Statistician, naive rounding MI imputations can bias estimates,
particularly when the underlying data are asymmetric or multimodal."


However instead of using Yucel Recai, Yulei He, and Alan Zaslavsky's
rounding strategy [
http://amstat.tandfonline.com/doi/pdf/10.1198/000313008X300912] , I opted
for the Latent Class Model: http://spitswww.uvt.nl/~vermunt/ginkel2007.pdf
Who are the authors of the R package poLCA.

Thanks
Chris

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to