Bioinformatics Advance Access originally published online on May 6, 2005
Bioinformatics 2005 21(14):3114-3121; doi:10.1093/bioinformatics/bti483
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Ensemble dependence model for classification and prediction of cancer and normal gene expression data
1Department of Electrical and Computer Engineering, University of Maryland College Park, MD 20742, USA
2Department of Electrical and Computer Engineering, University of British Columbia Canada
*To whom all correspondence should be addressed.
Motivation: DNA microarray technologies make it possible to simultaneously monitor thousands of genes' expression levels. A topic of great interest is to study the different expression profiles between microarray samples from cancer patients and normal subjects, by classifying them at gene expression levels. Currently, various clustering methods have been proposed in the literature to classify cancer and normal samples based on microarray data, and they are predominantly data-driven approaches. In this paper, we propose an alternative approach, a model-driven approach, which can reveal the relationship between the global gene expression profile and the subject's health status, and thus is promising in predicting the early development of cancer.
Results: In this work, we propose an ensemble dependence model, aimed at exploring the group dependence relationship of gene clusters. Under the framework of hypothesis-testing, we employ genes' dependence relationship as a feature to model and classify cancer and normal samples. The proposed classification scheme is applied to several real cancer datasets, including cDNA, Affymetrix microarray and proteomic data. It is noted that the proposed method yields very promising performance. We further investigate the eigenvalue pattern of the proposed method, and we discover different patterns between cancer and normal samples. Moreover, the transition between cancer and normal patterns suggests that the eigenvalue pattern of the proposed models may have potential to predict the early stage of cancer development. In addition, we examine the effects of possible model mismatch on the proposed scheme.
Availability: see Supplemental website at http://dsplab.eng.umd.edu/~genomics/edm
Contact: qiupeng{at}umd.edu
Received on October 29, 2004; revised on April 5, 2005; accepted on May 1, 2005
This article has been cited by other articles:
![]() |
B. Zhang, H. Li, R. B. Riggins, M. Zhan, J. Xuan, Z. Zhang, E. P. Hoffman, R. Clarke, and Y. Wang Differential dependency network analysis to identify condition-specific topological changes in biological networks Bioinformatics, February 15, 2009; 25(4): 526 - 532. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Q. Tang, L. Y. Han, H. H. Lin, J. Cui, J. Jia, B. C. Low, B. W. Li, and Y. Z. Chen Derivation of Stable Microarray Cancer-Differentiating Signatures Using Consensus Scoring of Multiple Random Sampling and Gene-Ranking Consistency Evaluation Cancer Res., October 15, 2007; 67(20): 9996 - 10003. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Qiu, Z. J. Wang, K. J. R. Liu, Z.-Z. Hu, and C. H. Wu Dependence network modeling for biomarker identification Bioinformatics, January 15, 2007; 23(2): 198 - 206. [Abstract] [Full Text] [PDF] |
||||

