Re: [R] Newey West HAC for pooled cross-section data

Achim Zeileis Wed, 27 Mar 2013 03:50:58 -0700

On Tue, 26 Mar 2013, SHISHIR MATHUR wrote:

Thanks for the reply Achim. The reason I suspect autocorrelation isbecause I think that within the same neighborhood, homes sold a fewmonths back are likely to impact the price of homes sold subsequently.

This may well be spatial (auto)correlation rather than temporalautocorrelation.

In fact the DW test and Breusch-Pagan test come out to be significant.So even though the data is not time series (that is, I do not haverepeated observations for the same house), however, the houses soldclose in time to each other are in the data set.

If there is a unique ordering of all observations by time, then you couldin principle apply an autocorrelation correction for the data, e.g., viaNewey-West.

But from what you describe above, it seems to be more important to capturespatial effects in the data, e.g., by using a spatial lag model (seelagsarlm in "spdep") or by using an additive spatial effect (see e.g. gamin "mgcv").

Thanks,
Shish

On Tue, Mar 26, 2013 at 3:51 PM, Achim Zeileis <achim.zeil...@uibk.ac.at>
wrote:
      On Tue, 26 Mar 2013, SHISHIR MATHUR wrote:

            Hello:
            My dataset set contains several thousand rows of
            data, with each row
            containing information for a house. The variables
            include the sale price of
            the house, the quarter and year of sale, the
            attributes of the house, and
            the attributes of the neighborhood and the city in
            which the house is
            located. The data is for a 10-year period. No house
            is repeated in the
            dataset. In summary, the dataset can be termed
            pooled cross-section data.

            My question: Can I estimate Newey-West HAC standard
            errors for a model that
            estimates the effect of various independent
            variables on the sale price of
            the house?  My understanding is that Newey-West can
            be used for time series
            and panel data. However, I am not sure whether it
            can be used for pooled
            cross-section data.  If yes, can you refer me to a
            specific source, such as
            a paper or a book?


      The result of your aggregation is a cross-section data set.
      Thus, there should be no correlation between the different
      observations - or in other terms, the ordering of your
      observations is completely arbitrary.

      Consequently, there may be heteroskedasticity but not
      autocorrelation. So you may use HC standard errors but HAC
      should not be necessary. (Using HAC standard errors will still
      be consistent but less efficient.)


            --
            Best,
            Shish

                    [[alternative HTML version deleted]]

            ______________________________________________
            R-help@r-project.org mailing list
            https://stat.ethz.ch/mailman/listinfo/r-help
            PLEASE do read the posting guide
            http://www.R-project.org/posting-guide.html
            and provide commented, minimal, self-contained,
            reproducible code.




--
Best,
Shishir

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Newey West HAC for pooled cross-section data

Reply via email to