Re: [R] mgcv: inclusion of random intercept in model - based on p-value of smooth or anova?

Simon Wood Fri, 11 May 2012 08:45:41 -0700

Dear Martijn,

Thanks for the off line code and data: very helpful.

The answer to this is something of a 'can of worms'. Starting with thep-value inconsistency. The problem here really is that neither test iswell justified in the case of s(...,"re") terms (and not having realisedthe extent of the problem it's not flagged properly).

In the case of the p-value from `summary', the p-value is computed as ifthe random effect were any other smooth. However the theory on which thep-values for smooths rests does not hold for "re" terms (basically theusual notion of smoothing bias is meaningless in the "re" case, and "re"terms can not usually be well approximated by truncated eigenapproximations). The upshot is that you can get bias toward acceptingthe null. I'll revert to doing something more sensible for "re" termsfor the next release, but it still won't be great, I guess.

The p-value from the comparison of models via 'anova' is equally suspectfor "re" terms. Basically, this test is justified as a roughapproximation in the case of usual smooth models, by the fact that wecan approximate the model smooths by unpenalized reduced rank eigenapproximations having degrees of freedom set to the effective degrees offreedom of the smooths. Again, however, such reduced rank approximationsare generally not available for "re" terms, and I don't know if there isthen a decent justification for the test in this case.

'AIC' might then be seen as the answer for model selection, but Grevenand Kneib (2010, Biometrika), show that this is biased towards selectingthe larger random effects model in this case (they provide a correction,but I'm not sure how easy it is to apply here).

You are left with a couple of sensible possibilities that are easy touse, if it's not clear from the estimates that the term is zero. Bothinvolve using gam(...,method="REML") or gam(...,method="ML").

1. use gam.vcomp to get a confidence interval for the "re" variancecomponent. If this is bounded well away from zero, then the result isclear.

2. Run a glrt test based on twice the difference in ML/REML scorereported for the 2 models (c.f. chisq on 1 df for your case). Thissuffers from the usual problem of using a glrt test to test a variancecomponent for equality to zero. (AIC based on this marginal likelihooddoesn't fix the problem either --- see Greven and Kneib, again).

The second issue, that adding a fixed effect can reduce the EDF, whileimproving the fit, is less of a problem, I think. If I'm happy to selectthe degree of smoothness of a model by GCV, REML or whatever, then Ishould also be happy to accept that the model with the fewer degrees offreedom, but more variables, is better than the one with more degrees offreedom and fewer variables. (The converse that I would ever reject thebetter fitting, less complex model is obviously perverse).

You can get similar effects in ordinary linear modelling: adding animportant predictor gives such an improvement in fit that you can droppolynomial dependencies on other predictors, so a model with moredegrees of freedom but fewer variables does worse than one with fewerdegrees of freedom and more variables... the issue is just a bit moreprominent when fitting GAMs because part of model selection isintegrated with fitting in this case.


best,
Simon




> 08/05/12 15:01, Martijn Wieling wrote:

Dear useRs,

I am using mgcv version 1.7-16. When I create a model with a few
non-linear terms and a random intercept for (in my case) country using
s(Country,bs="re"), the representative line in my model (i.e.
approximate significance of smooth terms) for the random intercept
reads:
                         edf       Ref.df     F          p-value
s(Country)       36.127 58.551   0.644    0.982

Can I interpret this as there being no support for a random intercept
for country? However, when I compare the simpler model to the model
including the random intercept, the latter appears to be a significant
improvement.

anova(gam1,gam2,test="F")

Model 1: ....
Model 2: .... + s(BirthNation, bs="re")
   Resid. Df Resid. Dev     Df Deviance      F    Pr(>F)
1    789.44     416.54
2    753.15     373.54 36.292   43.003 2.3891 1.225e-05 ***

I hope somebody could help me in how I should proceed in these
situations. Do I include the random intercept or not?

I also have a related question. When I used to create a mixed-effects
regression model using lmer and included e.g., an interaction in the
fixed-effects structure, I would test if the inclusion of this
interaction was warranted using anova(lmer1,lmer2). It then would show
me that I invested 1 additional df and the resulting (possibly
significant) improvement in fit of my model.

This approach does not seem to work when using gam. In this case an
apparent investment of 1 degree of freedom for the interaction, might
result in an actual decrease of the degrees of freedom invested by the
total model (caused by a decrease of the edf's of splines in the model
with the interaction). In this case, how would I proceed in
determining if the model including the interaction term is better?

With kind regards,
Martijn Wieling

--
*******************************************
Martijn Wieling
http://www.martijnwieling.nl
wiel...@gmail.com
+31(0)614108622
*******************************************
University of Groningen
http://www.rug.nl/staff/m.b.wieling

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.



--
Simon Wood, Mathematical Science, University of Bath BA2 7AY UK
+44 (0)1225 386603               http://people.bath.ac.uk/sw283

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] mgcv: inclusion of random intercept in model - based on p-value of smooth or anova?

Reply via email to