Bioinformatics Advance Access originally published online on February 15, 2005
Bioinformatics 2005 21(10):2403-2409; doi:10.1093/bioinformatics/bti324
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Boosting proportional hazards models using smoothing splines, with applications to high-dimensional microarray data
1Rowe Program in Human Genetics, University of California Davis, CA 95616, USA
2School of Mathematics and Systematic Sciences, Shandong University Jinan, Shandong 250100, PRC
*To whom correspondence should be addressed.
Motivation: An important area of research in the postgenomics era is to relate high-dimensional genetic or genomic data to various clinical phenotypes of patients. Due to large variability in time to certain clinical events among patients, studying possibly censored survival phenotypes can be more informative than treating the phenotypes as categorical variables. Due to high dimensionality and censoring, building a predictive model for time to event is more difficult than the classification/linear regression problem. We propose to develop a boosting procedure using smoothing splines for estimating the general proportional hazards models. Such a procedure can potentially be used for identifying non-linear effects of genes on the risk of developing an event.
Results: Our empirical simulation studies showed that the procedure can indeed recover the true functional forms of the covariates and can identify important variables that are related to the risk of an event. Results from predicting survival after chemotherapy for patients with diffuse large B-cell lymphoma demonstrate that the proposed method can be used for identifying important genes that are related to time to death due to cancer and for building a parsimonious model for predicting the survival of future patients. In addition, there is clear evidence of non-linear effects of some genes on survival time.
Contact: hli{at}ucdavis.edu
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
W. Lu and L. Li Boosting method for nonlinear transformation models with censored survival data Biostat., October 1, 2008; 9(4): 658 - 667. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Wei and H. Li Nonparametric pathway-based regression models for analysis of genomic data Biostat., April 1, 2007; 8(2): 265 - 284. [Abstract] [Full Text] [PDF] |
||||
