Bioinformatics 20(Suppl. 1) © Oxford University Press 2004; all rights reserved.
Partial Cox regression analysis for high-dimensional microarray gene expression data
Rowe Program in Human Genetics and Department of Statistics, University of California, Davis, CA 95616, USA
Received on January 15, 2004; accepted on March 1, 2004
Motivation: An important application of microarray technology is to predict various clinical phenotypes based on the gene expression profile. Success has been demonstrated in molecular classification of cancer in which different types of cancer serve as categorical outcome variable. However, there has been less research in linking gene expression profile to censored survival outcome such as patients' overall survival time or time to cancer relapse. In this paper, we develop a partial Cox regression method for constructing mutually uncorrelated components based on microarray gene expression data for predicting the survival of future patients.
Results: The proposed partial Cox regression method involves constructing predictive components by repeated least square fitting of residuals and Cox regression fitting. The key difference from the standard principal components of Cox regression analysis is that in constructing the predictive components, our method utilizes the observed survival/censoring information. We also propose to apply the time-dependent receiver operating characteristic curve analysis to evaluate the results. We applied our methods to a publicly available dataset of diffuse large B-cell lymphoma. The outcomes indicated that combining the partial Cox regression method with principal components analysis results in parsimonious model with fewer components and better predictive performance. We conclude that the proposed partial Cox regression method can be very useful in building a parsimonious predictive model that can accurately predict the survival of future patients based on the gene expression profile and survival times of previous patients.
Availability: R codes are available upon request.
Contact: hli{at}ucdavis.edu
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
L. Kaderali, T. Zander, U. Faigle, J. Wolf, J. L. Schultze, and R. Schrader CASPAR: a hierarchical bayesian approach to predict survival times in cancer from gene expression data Bioinformatics, June 15, 2006; 22(12): 1495 - 1502. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. R. Segal Microarray gene expression data with linked survival phenotypes: diffuse large-B-cell lymphoma revisited Biostat., April 1, 2006; 7(2): 268 - 285. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Li Survival prediction of diffuse large-B-cell lymphoma based on both clinical and gene expression information Bioinformatics, February 15, 2006; 22(4): 466 - 471. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Gui and H. Li Penalized Cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data Bioinformatics, July 1, 2005; 21(13): 3001 - 3008. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Li and Y. Luan Boosting proportional hazards models using smoothing splines, with applications to high-dimensional microarray data Bioinformatics, May 15, 2005; 21(10): 2403 - 2409. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Reverter, S. M. McWilliam, W. Barris, and B. P. Dalrymple A rapid method for computationally inferring transcriptome coverage and microarray sensitivity Bioinformatics, January 1, 2005; 21(1): 80 - 89. [Abstract] [Full Text] [PDF] |
||||

