Re: [R] Bestglm subset analysis

2016-06-29 Thread Jim Lemon
Hi Doug, To expand a bit on what Bert has written, all the the "best subset/best model" procedures use random variation in the dataset to produce a result. This means that you will almost certainly include variables in your "best model" that cannot be replicated. Sometimes you can see this as a var

Re: [R] Bestglm subset analysis

2016-06-29 Thread Bert Gunter
This is a statistics question, which is largely off topic on this list. However, I'll give you a very brief OT response: I would strongly suggest you consult a local statistician to explain to why what you are doing is likely to result in complete nonsense (best subset of 5 or 6 from 21 predictors

[R] Bestglm subset analysis

2016-06-29 Thread D Wolf via R-help
Hello All, I am working on a linear regression model and trying to find the best subset of variables for my dataset. I have 21 predictors, 1 response variable, and 79 observations. I need to find the best 5 or 6 predictors for my model. I've used leaps for lm() and I'm now trying bestglm for glm