On Sat, Oct 1, 2011 at 4:59 AM, Farley, Robert <farl...@metro.net> wrote: > I'm about to add weights to a bus on-board survey dataset with ~150 variables > and ~28,000 records. My intention is to weight (for each bus "run") by > boarding stop and alighting stop. I've seen the Rake function of the Survey > package, but it seems that converting to a "svydesign" might be excessive for > my purpose. > > My dataset has a huge number of unique "Run-Boarding" and "Run-Alighting" > groups each with a small number of records to expand. Would it be easier to > manually implement Iterative-Proportional-Fitting/Raking/Fratar/Furness on > the data? Or are there benefits to converting the data to a svydesign that > would make it valuable? This "traditional" weighting expands what we call > unlinked (based on each boarding)trips. I'm thinking of also using > IPF/Raking to estimate linked (based on each individual) trips. Would this > change the consideration of using the svydesign process? >
If you're planning to do any analysis afterwards it would be useful to have the data in a svydesign object, or if you end up needing to do weight trimming or bounding, or other slightly more complicated weight adjustments. Otherwise it might well just be easier to do your own IPF algorithm. -thomas -- Thomas Lumley Professor of Biostatistics University of Auckland ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.