Re: [R] Advice on approach to weighting survey

Thomas Lumley Sun, 02 Oct 2011 12:16:48 -0700

On Sat, Oct 1, 2011 at 4:59 AM, Farley, Robert <farl...@metro.net> wrote:
> I'm about to add weights to a bus on-board survey dataset with ~150 variables 
> and ~28,000 records.  My intention is to weight (for each bus "run") by 
> boarding stop and alighting stop.  I've seen the Rake function of the Survey 
> package, but it seems that converting to a "svydesign" might be excessive for 
> my purpose.
>
> My dataset has a huge number of unique "Run-Boarding" and "Run-Alighting" 
> groups each with a small number of records to expand.  Would it be easier to 
> manually implement Iterative-Proportional-Fitting/Raking/Fratar/Furness on 
> the data?  Or are there benefits to converting the data to a svydesign that 
> would make it valuable?  This "traditional" weighting expands what we call 
> unlinked (based on each boarding)trips.  I'm thinking of also using 
> IPF/Raking to estimate linked (based on each individual) trips.  Would this 
> change the consideration of using the svydesign process?
>


If you're planning to do any analysis afterwards it would be useful to
have the data in a svydesign object, or if you end up needing to do
weight trimming or bounding, or other slightly more complicated weight
adjustments.  Otherwise it might well just be easier to do your own
IPF algorithm.

   -thomas

-- 
Thomas Lumley
Professor of Biostatistics
University of Auckland

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Advice on approach to weighting survey

Reply via email to