Bioinformatics Advance Access published online on August 30, 2005
Bioinformatics, doi:10.1093/bioinformatics/bti647
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 The Whitaker Biomedical Engineering Institute and Center for Cardiovascular Bioinformatics and Modeling, The Johns Hopkins University, Baltimore MD 21218 USA
* To whom correspondence should be addressed.
Motivation: DNA Microarray data analysis has been used previously to identify marker genes which discriminate cancer from normal samples. However, due to the limited sample size of each study, there are few common markers among different studies of the same cancer. With the rapid accumulation of microarray data, it is of great interest to integrate inter-study microarray data to increase sample size, which could lead to the discovery of more reliable markers. Results: We present a novel, simple method of integrating different microarray data sets to identify marker genes and apply the method to prostate cancer data sets. In this study, by applying a new statistical method, referred to as the top-scoring pair (TSP) classifier, we have identified a pair of robust marker genes (HPN and STAT6) by integrating microarray data sets from three different prostate cancer studies. Cross-platform validation shows that the TSP classifier built from the marker gene pair, which simply compares relative expression values, achieves high accuracy, sensitivity, and specificity on independent data sets generated using various array platforms. Our findings suggest a new model for the discovery of marker genes from accumulated microarray data and demonstrate how the great wealth of microarray data can be exploited to increase the power of statistical analysis.
Received May 19, 2005
Revised June 21, 2005
Accepted August 25, 2005
Article
Robust prostate cancer marker genes emerge from direct integration of inter-study microarray data
2 The Whitaker Biomedical Engineering Institute and Center for Cardiovascular Bioinformatics and Modeling, The Johns Hopkins University, Baltimore MD 21218 USA; Department of Applied Mathematics and Statistics and Center for Imaging Sciences, The Johns Hopkins University, Baltimore MD 21218 USA
Lei Xu, E-mail: leixu{at}jhu.edu
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
C. Cheng, K. Shen, C. Song, J. Luo, and G. C. Tseng Ratio adjustment and calibration scheme for gene-wise normalization to enhance microarray inter-study prediction Bioinformatics, July 1, 2009; 25(13): 1655 - 1661. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. T. Leek The tspair package for finding top scoring pair classifiers in R Bioinformatics, May 1, 2009; 25(9): 1203 - 1204. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. G. Yulug and B. Gur-Dedeoglu Functional genomics in translational cancer research: focus on breast cancer Brief Funct Genomic Proteomic, March 7, 2008; (2008) eln009v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Lai, B.-l. Adam, R. Podolsky, and J.-X. She A mixture model approach to the tests of concordance and discordance between two large-scale experiments with two-sample groups Bioinformatics, May 15, 2007; 23(10): 1243 - 1250. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Stafford and M. Brun Three methods for optimization of cross-laboratory and cross-platform microarray expression data Nucleic Acids Res., May 11, 2007; 35(10): e72 - e72. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. D. Price, J. Trent, A. K. El-Naggar, D. Cogdell, E. Taylor, K. K. Hunt, R. E. Pollock, L. Hood, I. Shmulevich, and W. Zhang Highly accurate two-gene classifier for differentiating gastrointestinal stromal tumors and leiomyosarcomas PNAS, February 27, 2007; 104(9): 3414 - 3419. [Abstract] [Full Text] [PDF] |
||||



