Bioinformatics Advance Access published online on April 6, 2005
Bioinformatics, doi:10.1093/bioinformatics/bti419
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Tularik Inc., South San Francisco, California, 94080, USA
* To whom correspondence should be addressed.
Motivation: The development of microarray-based high throughput gene profiling has led to the hope that this technology could provide an efficient and accurate means of diagnosing and classifying tumors, as well as predicting prognoses and effective treatments. However, the large amount of data generated by microarrays requires the effective reduction of discriminant gene features into reliable sets of tumor biomarkers for such multiclass tumor discrimination. The availability of reliable sets of biomarkers, especially serum biomarkers, should have a major impact on our understanding and treatment of cancer. Results: We have combined genetic algorithm (GA) and all paired (AP) support vector machine (SVM) methods for multiclass cancer categorization. Predictive features can be automatically determined through iterative GA/SVM, leading to very compact sets of non-redundant cancer-relevant genes with the best classification performance reported to date. Interestingly, these different classifier sets harbor only modest overlapping gene features but have similar levels of accuracy in leave-one-out cross-validations (LOOCV). Further characterization of these optimal tumor discriminant features, including the use of Nearest Shrunken Centroids (NSC), analysis of annotations, and literature text mining, reveals previously unappreciated tumor subclasses and a series of genes that could be used as cancer biomarkers. With this approach, we believe that microarray-based multiclass molecular analysis can be an effective tool for cancer biomarker discovery and subsequent molecular cancer diagnosis. Availability: http://www.fishgenome.org/publication/Liu/bioinformatics/.
Received August 29, 2004
Revised March 11, 2005
Accepted March 30, 2005
Article
Multiclass cancer classification and biomarker discovery using GA-based algorithms
2 Zhejiang University, Hangzhou, 310027, China
3 Institute of Genetics and Developmental Biology, the Chinese Academy of Sciences, Beijing, 100101, China
Xuefeng Bruce Ling, E-mail: xuefeng_ling{at}yahoo.com
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
C. F. Basil, Y. Zhao, K. Zavaglia, P. Jin, M. C. Panelli, S. Voiculescu, S. Mandruzzato, H. M. Lee, B. Seliger, R. S. Freedman, et al. Common cancer biomarkers. Cancer Res., March 15, 2006; 66(6): 2953 - 2961. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Tan, L. Shi, S. M. Hussain, J. Xu, W. Tong, J. M. Frazier, and C. Wang Integrating time-course microarray gene expression profiles with cytotoxicity for identification of biomarkers in primary rat hepatocytes exposed to cadmium Bioinformatics, January 1, 2006; 22(1): 77 - 87. [Abstract] [Full Text] [PDF] |
||||

