Actually, the CI index and VIF are just a start.  It is best to look at what 
they call a matrix of "variance proportions" (found in SAS and a few other 
places...)--which hardly anyone understands (including the SAS folks).  It is a 
matrix of estimates of what the variences of the regression coefficients would 
be if you could figure them out in the first place.  It shows which factors 
dominate over others IN THE PARTICULAR SETUP you are analyzing.  The matrix is 
often calculated using eigenvalues, but is best done with Singular Value 
Decomposition techniques (you don't have to have a square matrix, and you 
maintain better precision).  Analysts will say that it can display an unstable 
system -- which is correct, but they generally say that, if its true, you have 
bad data and should throw it out--or collect more.  I suggest care, because it 
may be illustrating the nature of the system you are studying.

The only decent reference that I know of is a little book (hard to read) that I 
can't remember off the top of my head.  Have to look it up.

Timothy E. Paysen, Phd
Research Forester (ret.)




________________________________
From: John Sorkin <jsor...@grecc.umaryland.edu>
To: Alex Roy <alexroy2...@gmail.com>; r-help@r-project.org
Sent: Tuesday, July 21, 2009 4:19:11 AM
Subject: Re: [R] Collinearity in Linear Multiple Regression

I suggest you start by doing some reading about Condition index (CI) and 
variation inflation factor (VIF). Once you have reviewed the theory, a search 
of search.r-project.org (under the help menu in a windows-based R installation) 
for VIF will help you obtain values for VIF, c.f. 
http://finzi.psych.upenn.edu/R/library/HH/html/vif.html 
John

John David Sorkin M.D., Ph.D.
Chief, Biostatistics and Informatics
University of Maryland School of Medicine Division of Gerontology
Baltimore VA Medical Center
10 North Greene Street
GRECC (BT/18/GR)
Baltimore, MD 21201-1524
(Phone) 410-605-7119
(Fax) 410-605-7913 (Please call phone number above prior to faxing)

>>> Alex Roy <alexroy2...@gmail.com> 7/21/2009 7:01 AM >>>
Dear all,
                  How can I test for collinearity in the predictor data set
for multiple linear regression.

Thanks

Alex

    [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help 
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html 
and provide commented, minimal, self-contained, reproducible code.

Confidentiality Statement:
This email message, including any attachments, is for th...{{dropped:11}}

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to