Re: [R] survival::predict.coxph

Bernhard Reinhardt Fri, 27 Feb 2009 03:15:50 -0800

Hello Therry,

it´s really great to receive some feedback from a "pro". I´m not sure ifI´ve got the point right:You suppose that the cox-model isn´t good at forecasting an expectedsurvival time because of the issues with the prediction of thesurvival-function at the right tail and one should better use parametricmodels like an exponential model? Or what do you mean by "smoothparametric estimate"?Anyways I just ordered your book at the library. Hopefully I´ll get somemore insights by the lecture of it.


Maybe I should point out why I even tried to do such forecasts.

Following the article "Quantifying climate-related risks anduncertainties using Cox regression models" by Maia and Meinke I try todeduce winter-precipitation from lagged Sea-Surface-Temperatures (SSTs).So precipitation is my survival-time and and the SST-Observations atdifferent lags are my covariates.The sample size is only 55 and I´ve got 11 covariates (Lag=0 months toLag=10 months) to choose from.My first goal is to identify the optimal time-lag(s) betweenSST-Anomaly-Observation and Precipitation-Observation.

Expectation was that the lag should be some months.

I thought a cox-model would easily provide such a selection. At first Iused the covariates individually. Coefficients for lags between 0 and 5months were all quite big and then decreasing from 6 to 10 months. So Ithink 5 months could be the lag of the process and high persistence ofthe SST accounts for the big coefficients for 0-4 months.

As the next step I used all 11 covariates at once. I hoped to gainsimilar results. Instead the sign of the coefficients "randomly" jumpsfrom plus to minus and the magnitude as well is randomly distributed.

I also tried to using sets of three covariates e.g. with lag 4,5,6. Buteven then the sign of the coefficients is varying.

So my thought was that maybe I overfitted the model. But in fact I didnot find any literature if that´s even possible. As far as my limitedknowledge goes, overfitted models should reproduce the training-periodvery good but other periods very poor. So I first tried to reproduce thetraining-period. But so far with no success - as well with using 11covariates or just 1.


Regards

Bernhard R.

Terry Therneau wrote:

You are mostly correct.
Because of the censoring issue, there is no good estimate of the mean survivaltime. The survival curve either does not go to zero, or gets very noisy nearthe right hand tail (large standard error); a smooth parametric estimate is whatis really needed to deal with this.For this reason the mean survival, though computed (but see thesurvfit.print.mean option, help(print.survfit)) is not highly regarded. It isnot an option in predict.coxph.Terry T.
        
 ----begin included message --------------
Hi,
if I got it right then the survival-time we expect for a subject is theintegral over the specific survival-function of the subject from 0 to t_max.
If I have a trained cox-model and want to make a prediction of thesurvival-time for a new subject I could usesurvfit(coxmodel, newdata=newSubject) to estimate a newsurvival-function which I have to integrate thereafter.
Actually I thought predict(coxmodel, newSubject) would do this for me,but I?m confused which type I have to declare. If I understand thelittle pieces of documentation right then none of the available types isexactly the predicted survival-time.I think I have to use the mean survival-time of the baseline-functiontimes exp(the result of type linear predictor).
Am I right?


______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] survival::predict.coxph

Reply via email to