Bioinformatics Advance Access published online on November 26, 2006
Bioinformatics, doi:10.1093/bioinformatics/btl543
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Interdisciplinary Center for Biotechnology Research, University of Florida, Gainesville, FL 32611; Department of Electrical and Computer Engineering, University of Florida, Gainesville, FL 32611
* To whom correspondence should be addressed.
Motivation: Accurate prognosis of breast cancer can spare a significant number of breast cancer patients from receiving unnecessary adjuvant systemic treatment and its related expensive medical costs. Recent studies have demonstrated the potential value of gene expression signatures in assessing the risk of post-surgical disease recurrence. However, these studies all attempt to develop genetic marker-based prognostic systems to replace the existing clinical criteria, while ignoring the rich information contained in established clinical markers. Given the complexity of breast cancer prognosis, a more practical strategy would be to utilize both clinical and genetic marker information that may be complementary. Methods: A computational study is performed on publicly available microarray data which has spawned a 70-gene prognostic signature. The recently proposed I-RELIEF algorithm is used to identify a hybrid signature through the combination of both genetic and clinical markers. A rigorous experimental protocol is used to estimate the prognostic performance of the hybrid signature and other prognostic approaches. Survival data analyses is performed to compare different prognostic approaches. Results: The hybrid signature performs significantly better than other methods, including the 70-gene signature, clinical makers alone and the St. Gallen consensus criterion. At the 90% sensitivity level, the hybrid signature achieves 67% specificity, as compared to 47% for the 70-gene signature and 48% for the clinical makers. The odds ratio of the hybrid signature for developing distant metastases within five years between the patients with a good prognosis signature and the patients with a bad prognosis is 21.0 (95% CI: 6.5 - 68.3), far higher than either genetic or clinical markers alone. Availability: The breast cancer dataset is available at www.nature.com and Matlab codes are available upon request.
Received July 3, 2006
Revised October 11, 2006
Accepted October 15, 2006
Article
Improved breast cancer prognosis through the combination of clinical and genetic markers
Yijun Sun 1 *, Steve Goodison 2, Jian Li 3, Li Liu 4, and William Farmerie 4
2 Department of Surgery, University of Florida, Gainesville, FL 32611
3 Department of Electrical and Computer Engineering, University of Florida, Gainesville, FL 32611
4 Interdisciplinary Center for Biotechnology Research, University of Florida, Gainesville, FL 32611
Yijun Sun, E-mail: sun{at}dsp.ufl.edu
![]()
Abstract
Associate Editor: David Rocke
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. Barla, G. Jurman, S. Riccadonna, S. Merler, M. Chierici, and C. Furlanello Machine learning methods for predictive proteomics Brief Bioinform, March 1, 2008; 9(2): 119 - 128. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. L. Moldawer Opening the Window on Genome-wide Expression Analyses in Sepsis Am. J. Respir. Crit. Care Med., October 1, 2007; 176(7): 631 - 632. [Full Text] [PDF] |
||||

